Podcast
Questions and Answers
What does deceptive opinion spam intentionally aim to do?
What does deceptive opinion spam intentionally aim to do?
- Promote unrelated websites
- Be easily ignored by the user
- Contain advertisements
- Sound authentic to deceive readers (correct)
What is one example of something that opinion spam can include?
What is one example of something that opinion spam can include?
- In-depth analysis of a product
- Constructive criticism of a service
- Detailed personal experiences
- Self-promotion of an unrelated website (correct)
What type of opinion spam has previous work in the area focused on?
What type of opinion spam has previous work in the area focused on?
- Deceptive opinion spam
- Negative reviews
- Disruptive opinion spam (correct)
- Authentic opinions
What is the main characteristic of deceptive opinion spam?
What is the main characteristic of deceptive opinion spam?
What makes the risk of disruptive opinion spam 'minimal' for users?
What makes the risk of disruptive opinion spam 'minimal' for users?
What is the purpose of the gold-standard deceptive opinion spam dataset created in the study?
What is the purpose of the gold-standard deceptive opinion spam dataset created in the study?
What is the potential negative consequence of opinion spam?
What is the potential negative consequence of opinion spam?
What is the purpose of websites containing consumer reviews?
What is the purpose of websites containing consumer reviews?
What is one approach the study uses to detect deceptive opinion spam?
What is one approach the study uses to detect deceptive opinion spam?
What kind of opinions does the study focus on?
What kind of opinions does the study focus on?
Flashcards
Deceptive Opinion Spam
Deceptive Opinion Spam
Fictitious opinions deliberately written to sound authentic and deceive the reader.
Opinion Spam
Opinion Spam
Inappropriate or fraudulent reviews, ranging from self-promotion to review fraud.
Disruptive Opinion Spam
Disruptive Opinion Spam
Uncontroversial spam easily identified, such as advertisements or irrelevant text.
Mechanical Turk (AMT)
Mechanical Turk (AMT)
Signup and view all the flashcards
Truth-Bias
Truth-Bias
Signup and view all the flashcards
Cross-Validation (CV)
Cross-Validation (CV)
Signup and view all the flashcards
LIWC (Linguistic Inquiry and Word Count)
LIWC (Linguistic Inquiry and Word Count)
Signup and view all the flashcards
Machine Classifiers
Machine Classifiers
Signup and view all the flashcards
Naive Bayes (NB)
Naive Bayes (NB)
Signup and view all the flashcards
Support Vector Machine (SVM)
Support Vector Machine (SVM)
Signup and view all the flashcards
Study Notes
- Explores deceptive opinion spam, which are fictitious opinions deliberately written to sound authentic.
- Integrates psychology and computational linguistics to detect opinion spam.
- Develops a classifier with nearly 90% accuracy on the gold-standard opinion spam dataset.
- Reveals a relationship between deceptive opinions and imaginative writing through feature analysis.
Introduction
- Review websites with user-generated opinions have increasing potential for monetary gain through opinion spam.
- Opinion spam can range from self-promotion to deliberate review fraud.
- Focuses on deceptive opinion spam, which are fictitious opinions written to deceive the reader.
Related Work
- Focuses on opinion spam detection and identifies the challenges around this task.
- Opinion spam is both widespread and different in nature from either e-mail or Web spam.
- Aims to find deceptive opinion spam detection beyond what humans can easily recognize.
Dataset Construction and Human Performance
- Details the process of gathering and validating deceptive and truthful hotel reviews.
- Utilizes Amazon Mechanical Turk (AMT) to solicit deceptive opinions.
- Pays Turkers one US dollar for an accepted submission, asking them to write a fake positive review for a hotel.
- Mines all 5-star truthful reviews from the 20 most popular hotels on TripAdvisor in the Chicago area.
- Assesses human deception detection performance and determines that human judges are not effective at this task.
Automated Approaches to Deceptive Opinion Spam Detection
- Explores three automated approaches to detect deceptive opinion spam: genre identification, psycholinguistic deception detection, and text categorization.
- Assesses whether truthful and deceptive reviews have a relationship with part-of-speech (POS) tags in text
- Uses Linguistic Inquiry and Word Count (LIWC) in aim to detect personality.
- Uses three n-gram feature sets, like unigrams, bigrams, and trigrams when text categorizing and contrasting detection of different texts.
Classifiers
- Uses Naive Bayes and Support Vector Machine classifiers.
- Employs the SRI Language Modeling Toolkit (Stolcke, 2002) to estimate language models for truthful and deceptive opinions.
Results and Discussion
- Evaluates deception detection, discovering automated classifiers outperform human judges in nearly every metric.
- Human judges use unreliable cues for deception, as supported by one study of online dating.
- Psycholinguistic approach performs more effectively than the genre identification method.
- Suggests that truthful opinions use concrete language and focuses on elements external to the hotel.
- Identifies a plausible relationship between deceptive opinion spam and imaginative writing linked to POS distributional similarities.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.