Corpora in Linguistics
16 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the main characteristic of a general corpus?

  • It is limited to a specific time frame.
  • It is designed to investigate a particular type of language.
  • It includes texts from different languages.
  • It represents language in its broadest sense. (correct)
  • What is the purpose of a reference corpus?

  • To trace the development of a language over time.
  • To produce reference materials for language learning. (correct)
  • To analyze the language of a specific social setting.
  • To investigate a particular type of language.
  • What is the main difference between a general corpus and a specialized corpus?

  • The type of texts included. (correct)
  • The size of the corpus.
  • The language of the texts.
  • The time frame of the texts.
  • What is the purpose of a diachronic corpus?

    <p>To trace the development of a language over time.</p> Signup and view all the answers

    What is the name of the corpus that includes informal registers of British English?

    <p>CANCODE</p> Signup and view all the answers

    How many words does the BNC corpus contain?

    <p>100 million words</p> Signup and view all the answers

    What is the name of the corpus that includes spoken registers in a US academic setting?

    <p>MICASE</p> Signup and view all the answers

    What is the main characteristic of a specialized corpus?

    <p>It is designed with more specific research goals in mind.</p> Signup and view all the answers

    What is the primary purpose of Learner's Corpora?

    <p>To identify differences among learners and native speakers</p> Signup and view all the answers

    What is the characteristic of Comparable Corpora?

    <p>They are designed along the same lines with the same proportions of texts</p> Signup and view all the answers

    What is the primary purpose of Parallel Corpora?

    <p>To find potential equivalent expressions in each language</p> Signup and view all the answers

    What is the name of the Corpus that contains 1.5 million words?

    <p>Helsinki Corpus</p> Signup and view all the answers

    What is the primary purpose of Regional Corpora?

    <p>To represent a regional variety of a language</p> Signup and view all the answers

    What is the characteristic of the International Corpus of English (ICE)?

    <p>It is a comparable corpus of different varieties of English</p> Signup and view all the answers

    What is the size of the International Corpus of Learner English (ICLE)?

    <p>20,000 words</p> Signup and view all the answers

    What can Comparable Corpora be used for?

    <p>To compare different varieties of the same language</p> Signup and view all the answers

    Study Notes

    Corpora

    • A corpus is a large, structured set of electronically stored texts used for statistical analysis, hypothesis testing, and linguistic rule validation.

    Types of Corpora

    • General Corpora: Representative of language in its broadest sense, serving as a baseline for comparative studies of general linguistic features.
      • Examples: BROWN Corpus (1 million words), LOB Corpus (1 million words), BNC (British National Corpus, 100 million words)
    • Specialized Corpora: Designed with specific research goals in mind, focusing on a particular type of text, register, or language variety.
      • Examples: CANCODE (Cambridge and Nottingham Corpus of Discourse in English, 5 million words), MICASE (Michigan Corpus of Academic Spoken English, 5 million words)
    • Historical or Diachronic Corpora: Texts from different time periods, aiming to represent a stage or stages of a language's development.
      • Example: Helsinki Corpus (700-1700 texts, 1.5 million words)
    • Regional Corpora: Representative of a regional variety of a language, such as dialects.
    • Learner's Corpora: Representative of language produced by learners, including spoken and written language samples from non-native speakers.
      • Examples: Louvain English Essays (LOCNEE) Corpus, International Corpus of Learner English (ICLE, 20,000 words)
    • Comparable Corpora: Two or more corpora in different languages or varieties of a language, designed along the same lines.
      • Example: International Corpus of English (ICE, 1 million words each of different varieties of English)
    • Parallel Corpora: Two or more corpora in different languages, each containing translated texts or simultaneously produced texts in multiple languages.
      • Used by translators and learners to identify equivalent expressions and investigate linguistic differences.

    Studying That Suits You

    Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

    Quiz Team

    Description

    This quiz covers the basics of corpora in linguistics, including types of corpora and their applications.

    More Like This

    Introduction to Corpus Linguistics Quiz
    10 questions
    Annotated Corpus Utility
    5 questions

    Annotated Corpus Utility

    EfficientForethought avatar
    EfficientForethought
    Corpus Linguistics for Translators Quiz
    16 questions
    Modern Corpus Linguistics Overview
    6 questions
    Use Quizgecko on...
    Browser
    Browser