Podcast
Questions and Answers
Why does Hugh Craig criticize counting individual word frequencies?
Why does Hugh Craig criticize counting individual word frequencies?
- Because words are interdependent and form phrases and collocations. (correct)
- Because it is too difficult to accurately count word frequencies by hand.
- Because counting word frequencies only works with rare words.
- Because computers cannot adequately process individual words.
What is one advantage of counting words rather than phrases or collocations?
What is one advantage of counting words rather than phrases or collocations?
- It can be done quickly and does not require deep linguistic understanding.
- It is more aligned with manual search strategies.
- It provides more accurate insights into the author's style.
- It can be computerized with well-defined algorithms. (correct)
What is a limitation of analyzing language based on single words?
What is a limitation of analyzing language based on single words?
- It doesn't allow for the use of sophisticated vocabulary.
- It leads to overly complex analyses.
- It doesn't consider the emotional connotations of individual words.
- It fails to capture the unique sequences of joined words. (correct)
What does the phrase 'one word looks for another' suggest about language?
What does the phrase 'one word looks for another' suggest about language?
Which of the following describes the ideal method for analyzing word-clustering in texts, according to the passage?
Which of the following describes the ideal method for analyzing word-clustering in texts, according to the passage?
When processing the phrase 'with mirth in funeral and with', what is the resulting edge and weight from 'with' to 'in'?
When processing the phrase 'with mirth in funeral and with', what is the resulting edge and weight from 'with' to 'in'?
After processing 'with mirth in funeral and with', what is the weight of the edge from 'with' to 'and'?
After processing 'with mirth in funeral and with', what is the weight of the edge from 'with' to 'and'?
When processing the sequence ending 'with dirge in marriage In', what happens to the weight of edge from 'with' to 'in'?
When processing the sequence ending 'with dirge in marriage In', what happens to the weight of edge from 'with' to 'in'?
What action is taken when processing the sequence 'funeral and with dirge in' in relation to the edge from 'in' to 'with'?
What action is taken when processing the sequence 'funeral and with dirge in' in relation to the edge from 'in' to 'with'?
In the sequence 'marriage In equal scale weighing', what happens to the edge from 'in' back to itself?
In the sequence 'marriage In equal scale weighing', what happens to the edge from 'in' back to itself?
Flashcards
Word Frequency Analysis
Word Frequency Analysis
Analyzing the frequency of individual words in a text, ignoring the words' relationships and how they form phrases.
Word Co-occurrence Analysis
Word Co-occurrence Analysis
A statistical method that examines the co-occurrence of words in a text. It measures how often words appear together, suggesting grammatical or semantic relationships.
Word Adjacency Analysis
Word Adjacency Analysis
The study of how words are arranged and connected in a text. It goes beyond individual word counts to examine the structure and relationships of words in phrases and sentences.
Word Adjacency Networks (WANs)
Word Adjacency Networks (WANs)
Signup and view all the flashcards
Word Clustering Analysis
Word Clustering Analysis
Signup and view all the flashcards
Edge
Edge
Signup and view all the flashcards
Edge Weight
Edge Weight
Signup and view all the flashcards
Edge Creation
Edge Creation
Signup and view all the flashcards
Target Word Analysis
Target Word Analysis
Signup and view all the flashcards
Edge Weight Increment
Edge Weight Increment
Signup and view all the flashcards
Study Notes
Attributing Authorship of Henry VI Plays by Word Adjacency
- Santiago Segarra, Mark Eisen, Gabriel Egan, and Alejandro Ribeiro are the authors of the study
- The study was published in Shakespeare Quarterly, Volume 67, Number 2, Summer 2016
- The article spans pages 232-256
- The study examines the authorship of the Henry VI plays using word adjacency
- Published by Oxford University Press
- DOI: https://doi.org/10.1353/shq.2016.0024
- Additional information about the study is available via https://muse.jhu.edu/article/643795
Methods for Authorship Attribution
- Shorthand writing systems, like those used on subway systems, utilize redundancy in language.
- Claude Shannon's information theory is applied to measure redundancy in conventional writing
- Analyzing the frequency and proximity of words (adjacency) can reveal authorial patterns.
- Word adjacency networks (WANs) track the frequency and proximity of word choice in a text.
- The method counts the frequency of individual words, or the closeness of one word to the next to create a detailed profile characteristic of the writer
- Researchers create word adjacency networks (WANs) from texts to be attributed and match them to authorial profiles.
- Similarity of these networks is measured using statistical methods.
Validation of Method
- The method's effectiveness is validated by applying it to known texts, comparing the results to established authorship attributions.
- Accuracy is measured using relative entropy, comparing authorial networks in terms of word frequency and patterns
- Example texts used include: Hamlet, Satiromastix, 2 Henry IV and The Tempest, and texts by Middleton, and Jonson.
- The method can distinguish between known author styles with high accuracy.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.