5 Questions
Which dataset is the Stanford Sentiment Treebank based on?
IMDb dataset
What is the purpose of the Stanford Sentiment Treebank corpus?
To analyze the effects of sentiment in language
How many unique phrases are included in the Stanford Sentiment Treebank corpus?
215,154
What type of classification experiments refer to the dataset as SST-2 or SST binary?
Binary classification experiments on full sentences
In which language is the text in the dataset?
English (en)
Study Notes
Stanford Sentiment Treebank Corpus
- The Stanford Sentiment Treebank corpus is based on the Rotten Tomatoes dataset.
- The purpose of the Stanford Sentiment Treebank corpus is to support sentiment analysis research.
- The corpus includes approximately 10,000 unique phrases.
- Binary classification experiments, where the sentiment is classified as either positive or negative, refer to the dataset as SST-2 or SST binary.
- The text in the dataset is in the English language.
Test your knowledge on the Stanford Sentiment Treebank dataset and its features in this quiz. Explore the fully labeled parse trees and the compositional effects of sentiment in language.
Make Your Own Quizzes and Flashcards
Convert your notes into interactive study material.
Get started for free