Podcast
Questions and Answers
Dataset Card for [Dataset Name]
Dataset Card for [Dataset Name]
SST2
The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of ______ in language.
The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of ______ in language.
sentiment
The corpus is based on the dataset introduced by ______ (2005) and consists of 11,855 single sentences extracted from movie reviews.
The corpus is based on the dataset introduced by ______ (2005) and consists of 11,855 single sentences extracted from movie reviews.
Pang and Lee
It was parsed with the Stanford parser and includes a total of ______ unique phrases from those parse trees, each annotated by 3 human judges.
It was parsed with the Stanford parser and includes a total of ______ unique phrases from those parse trees, each annotated by 3 human judges.
Signup and view all the answers
Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as ______ or SST binary.
Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as ______ or SST binary.
Signup and view all the answers
Study Notes
Stanford Sentiment Treebank Dataset
- A corpus with fully labeled parse trees, allowing for complete analysis of compositional effects in language.
- Based on the dataset introduced by Rottenberg et al. (2005).
- Consists of 11,855 single sentences extracted from movie reviews.
- Parsed with the Stanford parser, including 215,154 unique phrases from those parse trees.
- Each phrase annotated by 3 human judges.
Binary Classification Experiments
- Full sentences categorized as negative or somewhat negative vs somewhat positive or positive.
- Neutral sentences discarded in the experiment.
- This dataset is referred to as SST binary.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Description
Test your knowledge on the Stanford Sentiment Treebank (SST2) dataset and its features with this quiz. Learn about the fully labeled parse trees and the compositional effects of sentiment in language.