Generative AI and Language Models Quiz

Podcast

Play an AI-generated podcast conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

Which of the following best describes the relationship between generative AI and traditional machine learning?

Generative AI is a type of traditional machine learning. (correct)
Generative AI is a completely separate field from traditional machine learning.
Generative AI and traditional machine learning are independent with no overlap.
Traditional machine learning is a subset of generative AI.

What is the primary way large language models learn their abilities?

By finding statistical patterns in massive datasets of human-generated content. (correct)
By interacting with the physical world and adapting their language through experience.
Through a process of manual annotation and fine-tuning by human experts.
By being directly programmed with specific rules for language.

Which of these models has the largest number of parameters, according to the provided information?

BERT
LLaMA
PaLM (correct)
GPT-3

What is the term used for the text input that is passed to a large language model?

Prompt (A) Signup and view all the answers

What does the 'context window' refer to in the context of large language models?

The space or memory available to the prompt. (D) Signup and view all the answers

What term describes the output generated by a large language model?

Completion (C) Signup and view all the answers

What process is known as using the model to generate text?

Inference (D) Signup and view all the answers

Which architectural approach significantly enhanced the performance of natural language tasks and led to a surge in generative capability?

Transformer architecture (C) Signup and view all the answers

What is the primary function of the attention mechanism in the transformer architecture?

To learn the importance of each word in a sequence relative to all other words. (D) Signup and view all the answers

Where are the attention weights established in a Language Model?

During the training of the language model. (A) Signup and view all the answers

What do attention maps help to visualize in the transformer model?

The relationship between tokens. (A) Signup and view all the answers

What is the role of positional encoding in the transformer model?

To provide data about the location of each token in the sequence (B) Signup and view all the answers

What is the purpose of the multiple heads in multi-headed self-attention?

To allow the model to learn different aspects of language in parallel. (D) Signup and view all the answers

After the attention weights have been applied, where are the outputs moved to next in the transformer model?

To a fully-connected feed-forward network. (D) Signup and view all the answers

What is the role of the softmax layer in the transformer architecture?

To transform the logits into probability scores. (B) Signup and view all the answers

What is 'prompt engineering' primarily focused on?

Creating effective instructions for the model in the prompt. (C) Signup and view all the answers

Which type of model directly processes a prompt using the decoder's layers?

Decoder-only models (D) Signup and view all the answers

What is the primary characteristic of in-context learning?

It includes task examples directly in the prompt. (B) Signup and view all the answers

What distinguishes zero-shot inference from other prompt strategies?

It does not include any examples in the prompt. (C) Signup and view all the answers

What defines one-shot inference?

A single example is included in the prompt. (A) Signup and view all the answers

When is few-shot inference most beneficial, according to the text?

When used with smaller models. (B) Signup and view all the answers

In an encoder-decoder model, what is the role of the encoder with respect to the prompt?

It processes the prompt to create a contextual representation. (B) Signup and view all the answers

What is the primary component the decoder in an encoder-decoder model uses to generate the final output?

The contextual representation provided by encoder. (A) Signup and view all the answers

Which of these sequences represents a progression from the least to most examples in a prompt?

Zero-shot, one-shot, few-shot (B) Signup and view all the answers

What primarily limits the `max tokens` parameter in a language model?

The context window size. (C) Signup and view all the answers

What is the function of the `max new tokens` parameter in a language model?

To control the maximum amount of tokens a model can produce, but not a guarantee of that exact number. (C) Signup and view all the answers

If a language model uses greedy decoding, what strategy does it employ to select the next word?

It always selects the word with the highest probability. (C) Signup and view all the answers

What is the main purpose of random sampling in language model output generation?

To introduce variability in the generated text. (A) Signup and view all the answers

What does the `top-p` parameter control in a language model?

The subset of possible next tokens the model considers when generating text. (C) Signup and view all the answers

How does increasing the `top-p` value (closer to 1) typically affect the output of a language model?

It makes the output more diverse and creative. (C) Signup and view all the answers

What is the key difference between the configuration parameters of a generative model and its training parameters?

Configuration parameters influence the model's output during inference, while training parameters are learned during training time. (D) Signup and view all the answers

Which of these is NOT a typical way to control the output of a generative language model?

Adjusting the context window size. (D) Signup and view all the answers

In top-p sampling, if token probabilities are: mat=0.4, floor=0.3, roof=0.15, sofa=0.1, tree=0.05, and top-p is set to 0.7, which tokens will the model consider?

mat, floor (C) Signup and view all the answers

What does setting top-p to 1.0 signify in the context of language model sampling?

It considers all tokens, with no filtering applied. (B) Signup and view all the answers

How does lowering the top-p value generally affect the output of a language model?

It narrows down the choices, leading to more predictable text. (A) Signup and view all the answers

What is the primary role of the temperature parameter in a language model?

It controls the randomness or creativity of the generated output. (A) Signup and view all the answers

How does a higher temperature parameter typically affect the generated text?

It tends to generate more creative and diverse outputs but might be nonsensical sometimes. (B) Signup and view all the answers

What is the main purpose of Retrieval-Augmented Generation (RAG)?

To enable a language model to reference an external knowledge base before generating output. (B) Signup and view all the answers

According to the content, what is an advantage of RAG, compared to fine-tuning?

It allows the model to access specific domain knowledge without retraining. (C) Signup and view all the answers

Which scenario best illustrates the use of a high temperature parameter?

Generating creative fiction with unexpected phrases. (B) Signup and view all the answers

What is the main goal of continuous pretraining of a language model?

To enhance the model's foundational knowledge in a specific domain (A) Signup and view all the answers

Which type of fine-tuning focuses on adapting a model to follow user instructions more effectively?

Instruction Tuning (D) Signup and view all the answers

What is the primary difference between fine-tuning and continuous pretraining?

Fine-tuning utilizes labeled data while continuous pretraining uses unlabeled data (D) Signup and view all the answers

What type of fine-tuning is exemplified by adapting a model for financial report summarization?

Task-Specific Fine-Tuning (A) Signup and view all the answers

What does fine-tuning primarily rely on to maximize the performance of a language model?

A dataset of labeled prompt-completion pairs (D) Signup and view all the answers

Which of the following describes a drawback of continuous pretraining?

It requires large amounts of domain-specific data (B) Signup and view all the answers

What is the primary benefit of fine-tuning a large language model?

Enhances specific task performance using labeled data (A) Signup and view all the answers

How does domain adaptation fine-tuning enhance model performance?

By tailoring the model for specific types of content (B) Signup and view all the answers

Flashcards

Generative AI

A subset of traditional machine learning that generates content.

Large Language Models (LLMs)

Machine learning models trained on vast amounts of text to understand and generate language.