Podcast
Questions and Answers
Explain the difference between categorical and dimensional representations of emotions.
Explain the difference between categorical and dimensional representations of emotions.
In a categorical representation, emotions are viewed as distinct categories (e.g., happiness, sadness). In a dimensional representation, emotions are plotted on a spectrum based on dimensions like valence, arousal, and dominance.
According to Paul Ekman's research, how many universal facial expressions of emotions are there?
According to Paul Ekman's research, how many universal facial expressions of emotions are there?
Paul Ekman's research identifies seven universal facial expressions of emotions.
Describe the VAD model and what each dimension represents.
Describe the VAD model and what each dimension represents.
The VAD model has three dimensions: Valence (how positive or negative the emotion is), Arousal (the intensity of the emotion), and Dominance (the level of control).
In the context of sentiment analysis, how can the Euclidean distance be used?
In the context of sentiment analysis, how can the Euclidean distance be used?
What is sentiment analysis, and how is it typically used?
What is sentiment analysis, and how is it typically used?
Explain the difference between dictionary-based and machine learning-based sentiment analysis.
Explain the difference between dictionary-based and machine learning-based sentiment analysis.
How is the overall sentiment score calculated in dictionary-based sentiment analysis?
How is the overall sentiment score calculated in dictionary-based sentiment analysis?
What is the crowdsourcing and how it is related to sentiment analysis?
What is the crowdsourcing and how it is related to sentiment analysis?
What is VADER, and what are some heuristics it considers?
What is VADER, and what are some heuristics it considers?
In the context of sentiment analysis, how can a dictionary-based approach be adopted using categorical emotion representation?
In the context of sentiment analysis, how can a dictionary-based approach be adopted using categorical emotion representation?
What do you need to do, to use machine-learning for sentiment analysis?
What do you need to do, to use machine-learning for sentiment analysis?
In what ways do "Bag of Words (BoW)" affect machine learning?
In what ways do "Bag of Words (BoW)" affect machine learning?
Describe the limitations of using the Bag of Words (BoW) model in feature extraction for sentiment analysis.
Describe the limitations of using the Bag of Words (BoW) model in feature extraction for sentiment analysis.
Summarize the key goals of using word embeddings in feature extraction.
Summarize the key goals of using word embeddings in feature extraction.
What information is being represented, for the following:
[King] – [man] + [woman] = ?
What information is being represented, for the following: [King] – [man] + [woman] = ?
Who was John Rupert Firth, and what was his contribution to sentiment analysis?
Who was John Rupert Firth, and what was his contribution to sentiment analysis?
What is the role of cosine similarity in the context of word embeddings, and how is it interpreted?
What is the role of cosine similarity in the context of word embeddings, and how is it interpreted?
Define the role of baseline regression as related to machine learning.
Define the role of baseline regression as related to machine learning.
In the equation,
y = f(w•x) + b
What do each of the terms describe?
In the equation, y = f(w•x) + b What do each of the terms describe?
Explain how a sigmoid function is used to compute sentiment.
Explain how a sigmoid function is used to compute sentiment.
Describe two possible types of data that an input feature vector can represent when training a regression model for sentiment analysis.
Describe two possible types of data that an input feature vector can represent when training a regression model for sentiment analysis.
What is the relationship between 'Happy' and 'Unhappy'. Further, where are each of their roots from, if any?
What is the relationship between 'Happy' and 'Unhappy'. Further, where are each of their roots from, if any?
What is a language model (LM), and what are they used for in NLP?
What is a language model (LM), and what are they used for in NLP?
What are some applications of sentiment analysis and how can businesses benefit from it?
What are some applications of sentiment analysis and how can businesses benefit from it?
How do sentiment analysis tools handle sarcasm and other forms of figurative language, and why can this be a challenge?
How do sentiment analysis tools handle sarcasm and other forms of figurative language, and why can this be a challenge?
What are some ethical considerations when using sentiment analysis, particularly in areas like political polling or customer feedback?
What are some ethical considerations when using sentiment analysis, particularly in areas like political polling or customer feedback?
What are some common preprocessing steps used on text data to improve machine learning model performance?
What are some common preprocessing steps used on text data to improve machine learning model performance?
How does self-supervised learning work?
How does self-supervised learning work?
Describe a time that required communications with more than one person. Then, label the context (i.e. bump into a friend at a bus stop, etc), the content, the environment and atmosphere, and briefly denote the activities performed.
Describe a time that required communications with more than one person. Then, label the context (i.e. bump into a friend at a bus stop, etc), the content, the environment and atmosphere, and briefly denote the activities performed.
How could you use chatGPT to measure your communications?
How could you use chatGPT to measure your communications?
Describe the first step of using machine learnign alorithms in natural language processiing
Describe the first step of using machine learnign alorithms in natural language processiing
What are two common challenges for applying for machine learning algorithms for language.
What are two common challenges for applying for machine learning algorithms for language.
Describe one algorithm or data structure that reduces the computational overhead of applying machine learning models to natural language.
Describe one algorithm or data structure that reduces the computational overhead of applying machine learning models to natural language.
Describe some examples that would be key metrics to understanding emotional states of a person
Describe some examples that would be key metrics to understanding emotional states of a person
How would you define "semantic understanding of language"?
How would you define "semantic understanding of language"?
How does a natural language processing model understand that "excellent customer service" means the same thing as "amazing customer service"?
How does a natural language processing model understand that "excellent customer service" means the same thing as "amazing customer service"?
When classifying data could there be biases? List out possible ways to check for these biases
When classifying data could there be biases? List out possible ways to check for these biases
When attempting to test out natural language processing frameworks, should you load balance all the data from many source, or slowly load the data from many sources?
When attempting to test out natural language processing frameworks, should you load balance all the data from many source, or slowly load the data from many sources?
Flashcards
What are emotions?
What are emotions?
The effect of chemical messages released by our brain.
Categorical emotions
Categorical emotions
Representations that classify emotions into distinct groups, such as happiness, sadness, anger, fear, surprise, and disgust.
Who is Paul Ekman?
Who is Paul Ekman?
Paul Ekman was a pioneer in studying emotions and their relation to facial expressions.
Dimensional representation of emotions?
Dimensional representation of emotions?
Signup and view all the flashcards
Define Valence
Define Valence
Signup and view all the flashcards
Define Arousal
Define Arousal
Signup and view all the flashcards
Define Dominance
Define Dominance
Signup and view all the flashcards
What is sentiment analysis?
What is sentiment analysis?
Signup and view all the flashcards
What is a sentiment dictionary?
What is a sentiment dictionary?
Signup and view all the flashcards
What does VADER consider?
What does VADER consider?
Signup and view all the flashcards
Categorically sentiment analysis
Categorically sentiment analysis
Signup and view all the flashcards
Feature Extraction
Feature Extraction
Signup and view all the flashcards
One Hot Encoding
One Hot Encoding
Signup and view all the flashcards
Bag of Words
Bag of Words
Signup and view all the flashcards
Bag Of Words Challenges
Bag Of Words Challenges
Signup and view all the flashcards
Word Embeddings
Word Embeddings
Signup and view all the flashcards
Word2Vec
Word2Vec
Signup and view all the flashcards
Cosine Similarity
Cosine Similarity
Signup and view all the flashcards
Regression Algorithm
Regression Algorithm
Signup and view all the flashcards
NRC Valence, Arousal and Dominance (VAD) Lexicon
NRC Valence, Arousal and Dominance (VAD) Lexicon
Signup and view all the flashcards
Study Notes
Week 1 Recap: What it means to be a human
- Being human involves having feelings and emotions.
- It's about the freedom to choose and experiencing new things.
- Humans conquer challenges and strive to make a positive impact.
- Being human includes self-awareness, empathy, and the ability to survive.
- Consciousness, forming relationships, and the ability to think critically are inherent.
- Humans can analyze, connect, and find meaning.
- Experiencing life through chapters, having a soul, and studying the world are factors.
- Humans can be irrational, make mistakes, and learn from them.
- Morality, ethics, and the ability to care for others, are key indicators.
Objectives
- Recognizing the science and role of emotions in humanity
- Describing how emotions are represented
- Learning to do automated sentiment analysis
Emotions and Brain Function
- Emotions come from chemical messages released in the brain.
- Stress hormones like adrenalin and cortisol are released when a potential threat is detected.
- A rewarding experience leads to the release of dopamine, oxytocin, or serotonin.
- The feeling region of the brain activates before the thinking part in certain situations.
- Emotions can be consciously managed.
Representation of Emotions
- Emotions are represented in 2 ways: Categorical and Dimensional
Paul Ekman and Categorical Representation
- Paul Ekman is a pioneer in studying emotions and facial expressions.
- He was a highly cited psychologist in the 20th century.
- He identified seven universal facial expressions of emotions.
- Anger
- Contempt
- Disgust
- Enjoyment
- Fear
- Sadness
- Surprise
- Ekman's work has greatly influenced psychology, sociology, and neuroscience.
Dimensional Representations of Emotions
- A VAD (Valence, Arousal, Dominance) model proposes emotions can be represented using a a spectrum.
- Valence shows how positive or negative an emotion is
- Arousal explains the intensity of the emotion
- Dominance relates to the level of control
- These can be mapped on a scale
Euclidean Distance
- Vectors allows for similarity calculations between emotions
- Formula: √(x1- xo)² + (y1 - yo)² + (z1 – 20)²
- Examples of euclidean distance scores between specific emotions:
- Anger and Disgust:.29
- Anger and Joy: 1.34
- Surprise and Joy: .74
Self-Assessment Manikin (SAM)
- The Self-Assessment Manikin (SAM) is used to assess the VAD (Valence, Arousal, Dominance) of an individual
Sentiment Analysis
- Sentiment analysis involves using automated processes to analyze text data for classification as negative, positive, or neutral.
- Sentiment analysis uses natural language processing, text analysis, computational linguistics, and biometrics to identify, extract, quantify, and study affective states and subjective information.
Sentiment Analysis Approaches
- The main approached for sentiment analysis are:
- Dictionary based approach
- AI or Machine Learning approach
Dictionary-Based Sentiment Analysis
- Involves using lists of words and polarity scores associated with different sentiments or emotions.
- Overall sentiment is based on an aggregate score from a text.
- The formula for overall sentiment is: (positive words - negative words) / (total words).
VADER
- VADER is a dictionary-based sentiment analysis tool
- Constructed using human ratings from Amazon Mechanical Turk and averaging their ratings
- Suited for sentiments in social media, emojis and abbreviations
- It considers the following:
- Punctuation
- Capitalization
- Degree modifiers
- Conjunctions
- Preceding Tri-gram
Dictionary-Based Categorical Approach
- It can be applied categorically, using emotions like joy, enjoyment, or fear.
Dictionary-Based Dimensional Approach
- It can be applied dimensionally, using VAD (Valence, Arousal, Dominance)
NRC Emotion Intensity Lexicon
- Categorical sentiment analysis based on eight emotions; anger, anticipation, disgust, fear, joy, sadness, surprise, and trust
- Words are assigned with each basic emotion and an intensity is scored
NRC Valence, Arousal and Dominance Lexicon
- Assigns valence, arousal and dominance scores to english words
AI / Machine Learning Approach to Sentiment Analysis
- There are no prebuilt dictionaries
- The algorithm looks for text examples, understands their labels, and learns.
- Enables detection of emotions and rules from learning.
- Steps:
- Provide Text
- Feature extraction
- Machine Learning
- Train Model
- Determine Sentiment Tag
Feature Extraction: One Hot Encoding
- Text is converted from into vectors
Feature Extraction: Bag of Words (BoW)
- It uses love, sweet, the dialogue, etc to determine the score
- Machine learning with Bow is affected ways with more or less memory, and challenging traditional algorithms
- Helps address by: stop or misspelt words, and stem algorithms
Feature Extraction : Word Embeddings
- Aims to reduce the number of dimensions to represent the vocabulary and embed words in a different space.
- It uses terms and dimensions from words such as Man, Woman, Boy, and Girl to determine a score
Word Embeddings Advantages
- Increases vocab without increasing the dimensionality
- Has word similarities like "Girl - Princess"
- Captures relationships captured eg from [king] – [man] + [woman]
Word Embeddings Training Algorithms
- Models such as Word2Vec uses machine learning where 'company' words are used to predict a target word within a local context.
Word Embedding (Cosine similarity)
- Vectors are measured by similarity by multidimensional space
- Equations involved are: idential vectors and completely different vectors
- This is a common technique in finding texts with similar semantic meanings
Regression
- A baseline supervised machine learning algorithm for classification
- Classification output from the training data
- Single input, x is represented by a vector of features
- Vector indicates importance in the output
- Formula: y = f(w•x) + b
- y means output
- f means trained weights
- w means "trained weights"
- x means input
- b means the bias
- A sigmoid function takes a real value and maps it to the range of sentiment.
Language
- Semantics (meaning) is involved with words
- Morphology (changing shape or form of words)
- Syntax (Structure/Order of sentences)
- Context (Caveman survived because they had no pressure to study)
Language Models in NLP
- Language models are statistical tools to predict the next word(s) in a sequence
- Paper entitled "Attention Is All You Need"
- Overcomes two limitations from previous previous a. Weak in long-range dependencies b.Sequential models
- This is a self-supervised learning process
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.