Tokenization

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

What is the correct tokenization for 'Finland's capital' in English?

  • Finland’s
  • Finland's capital
  • Finland (correct)
  • Finlands

What is the correct tokenization for 'state of the art' in English?

  • the art
  • state of the art (correct)
  • state
  • state of

What is the correct tokenization for 'L'ensemble' in French?

  • L'ensemble (correct)
  • L . ensemble
  • L’ . ensemble
  • Le ensemble

Which of the following is a correct tokenization for 'what're' in English?

<p>What are (D)</p> Signup and view all the answers

In French, how should 'L'ensemble' be tokenized?

<p>L'ensemble (D)</p> Signup and view all the answers

What is the correct tokenization for 'Lebensversicherungsgesellschaftsangestellter' in German?

<p>Lebensversicherungsgesellschaftsangestellter (C)</p> Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes

Tokenization

  • 'Finland's capital' in English should be tokenized as ['Finland', "'s", 'capital']
  • 'state of the art' in English should be tokenized as ['state', 'of', 'the', 'art']
  • 'L'ensemble' in French should be tokenized as ['L', "ensemble"]
  • 'what're' in English can be correctly tokenized as ['what', "'re"]
  • 'L'ensemble' in French should be tokenized as ['L', "ensemble"]
  • 'Lebensversicherungsgesellschaftsangestellter' in German should be tokenized as a single token, as it is a compound word.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

More Like This

Tokenization
9 questions

Tokenization

MercifulIsland avatar
MercifulIsland
Tokenization and Text Preprocessing Quiz
5 questions
Tokenization and Language Terminology Quiz
40 questions
Use Quizgecko on...
Browser
Browser