Masked Language Model (MLM) Overview

ChivalrousSmokyQuartz avatar
ChivalrousSmokyQuartz
·
·
Download

Start Quiz

Study Flashcards

6 Questions

What is the main training objective of a Masked Language Model (MLM)?

To predict the original tokens that were masked out

How does a Masked Language Model (MLM) differ from traditional language models in terms of contextual understanding?

MLMs use both preceding and following tokens to predict masked words, unlike traditional models

What special token is commonly used to replace masked words in a Masked Language Model (MLM)?

[MASK]

How does the self-supervised nature of Masked Language Models (MLMs) relate to label generation?

MLMs generate their own labels from the input text

What type of context does a Masked Language Model (MLM) leverage to predict masked tokens?

Bidirectional or non-directional context

What type of loss function is typically used in Masked Language Models (MLMs) during training?

Cross-entropy loss function

Learn about the Masked Language Model (MLM) technique used in self-supervised learning for natural language processing (NLP), popularized by models like BERT. Understand the process of masking words in a sentence and predicting them based on context.

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free
Use Quizgecko on...
Browser
Browser