Podcast
Questions and Answers
Dataset Card for [Dataset Name]
Dataset Card for [Dataset Name]
SST2
The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of ______ in language.
The Stanford Sentiment Treebank is a corpus with fully labeled parse trees that allows for a complete analysis of the compositional effects of ______ in language.
sentiment
The corpus is based on the dataset introduced by ______ (2005) and consists of 11,855 single sentences extracted from movie reviews.
The corpus is based on the dataset introduced by ______ (2005) and consists of 11,855 single sentences extracted from movie reviews.
Pang and Lee
It was parsed with the Stanford parser and includes a total of ______ unique phrases from those parse trees, each annotated by 3 human judges.
It was parsed with the Stanford parser and includes a total of ______ unique phrases from those parse trees, each annotated by 3 human judges.
Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as ______ or SST binary.
Binary classification experiments on full sentences (negative or somewhat negative vs somewhat positive or positive with neutral sentences discarded) refer to the dataset as ______ or SST binary.
Flashcards are hidden until you start studying
Study Notes
Stanford Sentiment Treebank Dataset
- A corpus with fully labeled parse trees, allowing for complete analysis of compositional effects in language.
- Based on the dataset introduced by Rottenberg et al. (2005).
- Consists of 11,855 single sentences extracted from movie reviews.
- Parsed with the Stanford parser, including 215,154 unique phrases from those parse trees.
- Each phrase annotated by 3 human judges.
Binary Classification Experiments
- Full sentences categorized as negative or somewhat negative vs somewhat positive or positive.
- Neutral sentences discarded in the experiment.
- This dataset is referred to as SST binary.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.