INTD 161 Pt 3

Podcast

Listen to an AI-generated conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

How can AI encode bias, as exemplified by the Dall-E image generation model?

By reinforcing stereotypes present in the training data. (correct)
By randomly generating images without any specific pattern.
By prioritizing creative and imaginative content over realistic portrayals.
By accurately reflecting the diversity present in society.

What is the primary distinction between AI and ML based on the lecture?

ML involves systems with goals that make decisions, while AI focuses on data extraction.
ML extracts knowledge from data to build models, while AI encompasses any system that mimics intelligence. (correct)
AI is a subset of ML focused on complex problem-solving, whereas ML is a broader field.
AI extracts knowledge from data to build models, while ML mimics intelligence directly.

Which of the following best describes the role of 'labels' in machine learning?

They outline the ethical guidelines for the model’s application.
They are used to define the algorithms to be used in the model.
They provide a description of the data collection process.
They implicitly tell the model what question to answer and what the correct answers are. (correct)

What is the role of data in machine learning models, according to the lecture?

To train models and distill information and knowledge. (B)

Signup and view all the answers

An ML model is presented with an image of an animal and outputs 'dog'. What type of ML model is it, and what kind of label does it use?

Classification model; category label. (D)

Signup and view all the answers

In the context of machine learning, what does it mean for a model to 'generalize' well?

The model can make accurate predictions on new, unseen data. (C)

Signup and view all the answers

Which statement accurately describes the utilization of decision trees in machine learning?

They offer a visually intuitive representation of decision-making processes. (C)

Signup and view all the answers

What is a primary advantage of using decision trees in machine learning?

Their great explainability. (A)

Signup and view all the answers

What does the 'Bag of Words' representation primarily aim to achieve in natural language processing?

Converting text into a numerical format suitable for machine learning models. (B)

Signup and view all the answers

In the context of count-based models in NLP, what does the term 'near' typically refer to when incrementing the count in each cell?

Words that appear in close proximity to each other, like within the same sentence or document. (B)

Signup and view all the answers

What is the advantage of updating ML models versus expert systems when inaccuracies are found?

ML models can be updated more easily, allowing for quicker correction of inaccuracies. (C)

Signup and view all the answers

Which of the following is a task that can be approached using supervised learning?

Developing a model that predicts housing prices based on features of the property. (A)

Signup and view all the answers

How do Large Language Models (LLMs) predict the next word in a sentence?

By analyzing a context window of words to predict the subsequent word. (D)

Signup and view all the answers

In the context of supervised learning, what is the source of the data's labels used to train the models?

Provided by human experts or annotators. (A)

Signup and view all the answers

Why is it important to be careful about bias in data used for AI systems?

AI systems can encode and perpetuate biases present in the data, leading to unfair or discriminatory outcomes. (D)

Signup and view all the answers

Which of the following tasks is best best described as AI but not ML?

Expert system MYCIN. (C)

Signup and view all the answers

Which of the following tasks is best described as both AI and ML?

Spam filter. (D)

Signup and view all the answers

Which of the following best describes the relationship between data and ML model?

Data is the first building block to ML models to extract information. (C)

Signup and view all the answers

How do you input a picture into an ML model and what type of model does it use?

Pictures are inputed as numbers into ML model. (A)

Signup and view all the answers

What is the purpose of labels in ML model?

Labels is for the model to know what question we want to be answered and that the right answers are. (D)

Signup and view all the answers

What is the difference(s) between Classification and Regression?

Classification is the output of category output but Regression is the output of continues output. (A)

Signup and view all the answers

What is unsupervised learning?

It focuses on scenarios when the model does not learn from labeled data. (C)

Signup and view all the answers

What did LLMs learn?

That "the and animal" are import words in a sentence. (A)

Signup and view all the answers

According to the video, what is one of the most basic paradigms in ML?

Supervised learning. (C)

Signup and view all the answers

ML models are better at the examples we give it, what is the name of it?

data. (D)

Signup and view all the answers

Decision trees are what type of machine to use to separate the data?

Observations, given precipitations, clothes, and other environment labels. (B)

Signup and view all the answers

What will happen if there is more data to create ML models to do a prediction?

ML models provide generalize to new data. (D)

Signup and view all the answers

What happened if the temperature is -5, with Snow, on Wednesday wearing casual, what will the label show?

The label should show Bus because Wednesday does to generalize the temp and precipitations. (A)

Signup and view all the answers

What are decision trees?

Decision tree is a type of machine learning algorithms. (B)

Signup and view all the answers

Which of the following best describes the AI system MYCIN?

It is not an ML model but an expert rule-based system. (C)

Signup and view all the answers

What is needed when the AI system is inaccurate?

We can update the knowledge in ML models. (D)

Signup and view all the answers

The weather man forecast the weather as raining with 100 percentage and thunderstorm for Tuesday at 10:00AM, what are the category and values?

Category(Rainy for thunderstorm) and values(percentage of temp, humidity). (C)

Signup and view all the answers

Which statement is true regarding data and ML?

ML is always dependent to data. (A)

Signup and view all the answers

What does count-based models mean for NLP?

Conversion of each word to a bunch of numbers, based on its relations to other words. (A)

Signup and view all the answers

If you provide a cat list of pictures and a dog list of pictures. Then the ML model shows a dinosaur. What do you call that?

ML Models learn about all the questions in the photos and can answer all the results. (A)

Signup and view all the answers

What aspect of AI systems requires careful attention to prevent biased outputs?

The data they are trained on. (C)

Signup and view all the answers

What term describes the concept of data in machine learning?

Information. (B)

Signup and view all the answers

In the machine learning pipeline, what is the purpose of the 'Training' stage?

To extract info/knowledge from data to build the model (D)

Signup and view all the answers

In a machine learning context, what is the role of 'labels'?

Indicate the correct answer/output/prediction for a given input (A)

Signup and view all the answers

What is the primary objective of converting text into numerical representations in NLP?

To enable mathematical calculations for machine learning models. (B)

Signup and view all the answers

In count-based models for NLP, what does incrementing the count in a cell signify?

The co-occurrence of two words within a defined context. (A)

Signup and view all the answers

What is a key characteristic of supervised learning?

Models learn from labeled datasets to imitate provided answers. (C)

Signup and view all the answers

What is the practical implication of the statement that an ML model is an 'imitation machine'?

The model's accuracy is limited by the quality and nature of the training data. (A)

Signup and view all the answers

What is the core difference between how an expert system and a supervised learning system provide answers?

Expert systems answer based on predefined rules, while supervised learning systems attempt to imitate answers provided in training data. (D)

Signup and view all the answers

Why are ML models easier to update when inaccuracies are found, compared to expert systems?

ML models can be retrained with new data to adjust their behavior. (B)

Signup and view all the answers

What are the main components to consider when determining the type of transportation a person is using?

All the above. (B)

Signup and view all the answers

What does it mean for a machine learning model to 'generalize' well?

It can accurately make predictions on new, unseen data. (B)

Signup and view all the answers

In the context of the YouTube video recommendation system, what serves as the 'input' for the ML model?

The user's viewing history, current video, likes, dislikes, etc. (B)

Signup and view all the answers

In Google Lens, what is the primary role of machine learning?

To analyze the objects in order to characterize them. (C)

Signup and view all the answers

What best describes the role of data in ChatGPT?

It is used to train the model on language patterns and knowledge. (C)

Signup and view all the answers

Consider the statement: 'I had to take my _____ to the vet.' What concept does this illustrate in language models?

The importance of context in determining word choice. (D)

Signup and view all the answers

In Large Language Models (LLMs), what is the purpose of the 'context window'?

To focus on a subset of text for predicting the next word. (C)

Signup and view all the answers

Given the data of temperature, precipitation, day and clothes; What type of ML is used to produce the output (label)?

Supervised Learning (B)

Signup and view all the answers

How would you describe passive imitation?

Model has no agency: it learns passively (B)

Signup and view all the answers

What does it mean for ML to be great explainability?

Easy for us to understand how and why a decision tree makes a certain prediction (D)

Signup and view all the answers

When should we be cautious of bias when training data?

If AI systems can encode bias if not well taken care of! (B)

Signup and view all the answers

When is assignment #2 due?

Next Tuesday! (A)

Signup and view all the answers

What does Building blocks of ML consists of?

All the above (B)

Signup and view all the answers

Why does 3 submissions exist for assignment #2?

All the above (D)

Signup and view all the answers

Where did the labels/answers come mostly from in Supervised Learning?

People (C)

Signup and view all the answers

Which of these applications of AI is also ML?

Spam filter (D)

Signup and view all the answers

What does Data consists of?

A collection of discrete or continuous values that convey Information (B)

Signup and view all the answers

How to you input a picture into an ML model?

Convert it into numbers (B)

Signup and view all the answers

What does classify do regarding the types of news article?

Many options (C)

Signup and view all the answers

What are the amount of options when classifying a review type of postive or negative?

Two options (B)

Signup and view all the answers

What does 'Large' stand for in LLM?

Huge Neural Nets with Billions of connections & neurons (D)

Signup and view all the answers

In Count Based Models, what does 'near' mean?

All the above (B)

Signup and view all the answers

Which of this option will the statement refer to 'Cats will Escape'?

Will (A)

Signup and view all the answers

Given the temp is high, predict the label. Temp > 0

Walk (C)

Signup and view all the answers

What does Generalize mean related to ML?

Generalization is the ability to make predictions on new, unseen data (D)

Signup and view all the answers

What is a primary limitation of a perceptron?

Inability to model complex non-linear relationships. (D)

Signup and view all the answers

What distinguishes generative learning from supervised learning?

Generative learning uses labels to predict data, while supervised learning predicts labels from data. (A)

Signup and view all the answers

Which factor does not significantly influence the decision-making when choosing between different ML models?

The size of the team working on the project. (B)

Signup and view all the answers

What is the primary advantage of using generative learning with unlabeled data?

It allows for scaling to large datasets, thus reducing human effort in labeling. (B)

Signup and view all the answers

In the context of Large Language Models (LLMs), what is the purpose of the 'context window'?

To provide the model with a subset of text that dictates the next word in a sequence. (B)

Signup and view all the answers

What characteristic defines sequence generation models, such as those used for video generation?

They produce data by generating a sequence of data, building upon the previously provided data. (B)

Signup and view all the answers

In Generative Adversarial Networks (GANs), what is the role of the 'discriminator'?

To decide if a given output is real or artificial. (A)

Signup and view all the answers

Why is it important for generative models to have a large amount of data available?

To create outputs to be more like human outputs. (B)

Signup and view all the answers

How do decision trees classify?

By separating data using multiple straight line segments. (A)

Signup and view all the answers

What is the goal of the generator in Generative Adversarial Networks (GANs)?

Create training images to make the discriminator make an error. (C)

Signup and view all the answers

What best describes the modelling power of the Artificial Neural Network?

Excellent (B)

Signup and view all the answers

What could generative models help application(s) related to?

All of the above (D)

Signup and view all the answers

In the three models; Perceptron, Decision Tree, and Artificial Neural Network; Which of these models has the simplest interperability?

Perceptron (B)

Signup and view all the answers

What does a LLM (Large language Model) look at to predict the next word?

Looks at a context window (B)

Signup and view all the answers

Which action does unsupervised learning perform?

None of the above (D)

Signup and view all the answers

Why is the output of a generative learning randomized?

To create different predictions (D)

Signup and view all the answers

What is meant by needing more than lines?

The strength of ANNs has complex non-linear functions. (A)

Signup and view all the answers

With the 3 models; Perceptron, Decision Tree, and ANN. If the interpretability is poor, what model will that be?

ANN (B)

Signup and view all the answers

Generative learning commonly has...

No labels (A)

Signup and view all the answers

When would a statement like, 'This is a big deal!' be said?

If the model is designed to predict-the-next-word like an LLM. (B)

Signup and view all the answers

What is the risk of training the models recursively on data that has been generated?

The models may begin to lose training data (A)

Signup and view all the answers

What functions are ANNs used for?

Complex Non-Linear Functions (B)

Signup and view all the answers

Is it possible to turn our simple decision tree into a 2d plot?

TRUE (D)

Signup and view all the answers

What two networks are used in GAN?

Two artificial networks used to generate interesting novel training data. (A)

Signup and view all the answers

Minksky's famous book did what to perceptrons?

Killed perceptrons (D)

Signup and view all the answers

Why do we want the model to be creative with a dice?

Both A and C (C)

Signup and view all the answers

What does Generative Learning need to generate new images?

Dataset of images (D)

Signup and view all the answers

What should happen that tells us to stop?

If generator training goes well, the discriminator gets worse at telling the difference between real and fake. it starts to classify fake data as real, and its accuracy decreases (A)

Signup and view all the answers

What is the difference between supervised learning and generative learning?

All of the above (D)

Signup and view all the answers

How does Chat GPT generate?

Human-like conversation responses (B)

Signup and view all the answers

If chatGPT outputs something that is wrong, will we need to update the answers?

No, Human labels will be needed (B)

Signup and view all the answers

Which model can learn complex non-linear functions?

Artificial Neural Networks (A)

Signup and view all the answers

What is a characteristic of sequence generation models?

The ability to generate a frame at a time. (C)

Signup and view all the answers

What is the purpose of Data?

The information used to Train the model (C)

Signup and view all the answers

Is AI in creative activities?

Yes, to write stories articles (B)

Signup and view all the answers

Which of this option is right?

All of the above. (D)

Signup and view all the answers

What is a key characteristic that distinguishes generative learning from supervised learning?

Generative learning can work with unlabeled data to generate new data, while supervised learning requires labeled data to learn a mapping. (C)

Signup and view all the answers

What is the primary goal of the generator network in a Generative Adversarial Network (GAN)?

To create synthetic images that are indistinguishable from real images in the training set. (D)

Signup and view all the answers

Why is it often necessary for generative models to be trained with a large amount of data?

To capture the underlying patterns and distribution of the data, enabling the generation of realistic and diverse outputs. (D)

Signup and view all the answers

In the context of Large Language Models (LLMs), like ChatGPT, what is the role of the 'context window'?

To serve as the input the LLM looks at to predict the next word in a sequence. (C)

Signup and view all the answers

What is the main purpose of the discriminator in a Generative Adversarial Network (GAN)?

To distinguish between real and fake data samples. (A)

Signup and view all the answers

What is a key limitation of a perceptron in machine learning?

Its inability to learn non-linear relationships. (D)

Signup and view all the answers

What is meant by needing more than lines in machine learning?

Employing complex models for non-linear relationships. (A)

Signup and view all the answers

What is the key factor that sets generative learning apart, making it scalable and useful?

Its ability to function effectively with large amounts of unlabeled data, such as for next word prediction. (A)

Signup and view all the answers

What prompted skepticism and a subsequent decline in the popularity of research on perceptrons in the late 1960s?

Discovery of their inability to solve problems with non-linear relationships, as highlighted by the XOR problem. (B)

Signup and view all the answers

In what context might the statement 'This is a big deal!' be said?

To express the significance of leveraging unlabeled data and scaling learning effectively. (D)

Signup and view all the answers

What are Artificial Neural Networks (ANNs) most well known for?

They learn complex non-linear function. (B)

Signup and view all the answers

What is the significance of the statement 'Sutton's Bitter Lesson'?

Exploiting computation and data usually leads to the best outcome. (B)

Signup and view all the answers

If the interpretability score is poor, what model will that be?

ANN. (B)

Signup and view all the answers

Can you always turn our simple decision tree into a 2d plot?

Yes, you can always turn our simple decision tree into a 2d plot. (D)

Signup and view all the answers

Why is the output of a generative learning model randomized?

To make results appear more natural and creative. (B)

Signup and view all the answers

If it wasn't possible to separate XOR function, then what happened?

First AI winter. (C)

Signup and view all the answers

What are two networks that are used in GAN?

Generator, Discriminator. (D)

Signup and view all the answers

How and why do you stop a generative learning?

Both A and D. (E)

Signup and view all the answers

Why is the Generator helpful to use?

The generator is the useful artifact to the user. (B)

Signup and view all the answers

If a Cat picture is given to the model and cat picture is generated, what is this?

Generative Learning. (C)

Signup and view all the answers

A pathology foundation model is what type of example?

AI diagnostic in health care. (D)

Signup and view all the answers

ChatGPT prompt: Tell me a joke that I can use in a class that teaches Al to the public. ChatGPT generates a joke. What is it?

Why was the computer cold? (A)

Signup and view all the answers

What is an accurate description regarding the model's output in generative learning systems?

Often randomized to foster creativity. (B)

Signup and view all the answers

What action does unsupervised learning perform on the training dataset?

None of the above. (D)

Signup and view all the answers

What does a Large Language Model look at to predict the next word?

It looks at a context window. (B)

Signup and view all the answers

What should be considered regarding the topic of Recursive Training?

Can be potentially biased. (B)

Signup and view all the answers

When considering bias with generative AI, what action should a user take?

Be aware of the bias generative AI. (C)

Signup and view all the answers

Flashcards

What is AI?

Anything that mimics intelligence

What is Machine Learning (ML)?

A subset of AI focusing on extracting knowledge from data to build models for predictions

What is bias in ML?

ML models may encode societal biases if not carefully addressed.

What is Data?

Values that convey information, meaning, or statistics.

Signup and view all the flashcards

What is ML Training?

Extracting info/knowledge from data sets to build a predictive model.

Signup and view all the flashcards

What is ML Prediction?

Using a model to provide an answer to a query.

Signup and view all the flashcards

ML labels

Implicitly tell the model what question we want it to answer and what the right answers are.

Signup and view all the flashcards

ML Classification

The output of the model is a category of the input.

Signup and view all the flashcards

ML Regression

The output of the model is a continuous numerical value.

Signup and view all the flashcards

ML sequence

A sequence of words as output of a model.

Signup and view all the flashcards

Bag of Words

Converting text to numbers so models can do calculations.

Signup and view all the flashcards

Count Based Models

Encoding words based on relations.

Signup and view all the flashcards

Generalization

The ability for a model to make predictions on new unseen data.

Signup and view all the flashcards

Decision Trees

Classic ML models easy to understand.

Signup and view all the flashcards

Supervised Learning

Models learn from data and labeled outputs.

Signup and view all the flashcards

ML Data

A collection of numbers or text used to train machine learning models.

Signup and view all the flashcards

ML Model

A computer program that processes input and generates output, typically numerical.

Signup and view all the flashcards

ML Strengths and Weaknesses

How well a model learns is based on strengths and weaknesses.

Signup and view all the flashcards

Modelling Power

Ability of a model to discern complex patterns in data.

Signup and view all the flashcards

Data Requirements

Amount of data required to effectively train a model.

Signup and view all the flashcards

Trainability

Difficulty of training a model and expertise required

Signup and view all the flashcards

Interpretability

How easily we can understand a model and what it learns.

Signup and view all the flashcards

Linear Separability

Categories can be distinguished with one line.

Signup and view all the flashcards

Perceptrons (1969)

An influential book discussing decision making limitations.

Signup and view all the flashcards

Generative Learning

A model that generates new data that is similar to the training data.

Signup and view all the flashcards

Choosing a Model

Models are complex and have parameters with a cost function and hyperparameters.

Signup and view all the flashcards

Dall-E 3

Generative models that generate images from text prompts.

Signup and view all the flashcards

Context Window

A subset of a text source used by LLMs to predict the next word.

Signup and view all the flashcards

Generative Learning Benefit

Models that can learn from labeled and unlabeled data.

Signup and view all the flashcards

GANs (Generative Adversarial Networks)

Frameworks that combine generative and supervised learning.

Signup and view all the flashcards

Sequence generation

Model predicts the next thing that will happen in a sequence.

Signup and view all the flashcards

Study Notes

AI systems can encode biases if not taken care of.
Dall-E, an AI image generation model, produces images reflecting stereotypes of successful people as white, male, young, dressed in Western business attire, working in urban offices, and having common hairstyles.

Announcements

Assignment #2 is live and due Next Tuesday, with three possible submissions.
Three submissions are allowed, to mitigate accidental submissions, internet outages, etc.
There will be no feedback provided upon submitting.
There will be no feedback provided upon submitting.
Asignment #2 is due tonight
Asignment #1 grades have been released
Asignment #3 opens tonight
Asignment #3 is due the following Tuesday and will be like A2. It will be a multiple choice quiz in Python

Prerequisites and Learning Objectives

From last lecture, learn the building blocks of machine learning, data and models.
Data is the information used to train the model, like text and images
The model is a computer program that processes input and creates output that's usually a number or a collection of numbers.
Supervised learning learns to imitate, and learns to predict the label of data, which are created by human labelers
Key learning objectives for this content include:
Listing the strengths and weaknesses of different models.
Distinguishing between generative and supervised learning
Listing examples of generative learning systems and what they are used for

Machine Learning Building Blocks

Machine learning has 2 building blocks, data and models
Supervised learning is the basic paradigm
The goal is to list and describe the building blocks of ML systems
The goal is to identify problems that can be thought of as supervised learning

AI Caution, Fear, Excitement

“Success in creating effective AI, could be the biggest event in the history of our civilization. Or the worst. We just don't know. So we cannot know if we will be infinitely helped by AI, or ignored by it and side-lined, or conceivably destroyed by it,” ~Stephen Hawking
“The rise of AI will free people up to do things that software never will—teaching, caring for patients, and supporting the elderly, for example.” ~Bill Gates
“AI is far more dangerous than nukes.” ~Elon Musk
There are three reasons why people fear AI
Cynicism is the belief that it is rational not to cooperate
Humanism/racism is systematic bias against machines, denial of their potential moral worth and personhood
Conservatism is the fear of change, fear of the other tribe
None of these fears reflect well on those who hold them- Rich Sutton (UoA)

AI Versus ML

AI mimics intelligence, it includes systems with goals that make decisions.
ML is a subset of AI where knowledge is extracted from data to build predictive models.
Expert system MYCIN is given as an example of AI without ML.
A simulated path-finding robot is another example of AI, not ML.
Spam filters, YouTube recommendations, Google Lens, and ChatGPT are examples of both AI and ML.
ML is at the core of most modern AI systems.

Deciding which models to use

All basic models have different strengths and weaknesses
Modelling Power is whether the model can learn complex patterns
Data requirements is the amount of data needed to train the model
Trainability is whether it is difficult to train, and may require ML Expertise
Interpretability is understanding what the model learns

Data as a building block

In ML, data trains models to distill information and knowledge.
Data is a collection of discrete or continuous values conveying information, quantity, quality, facts, and statistics.

Course Map

The topics in this deck will include the Machine Learning Building Blocks 2 as well as Generative AI Systems, and choose a model.

Lecture Topics

This lecture will cover how to choose a model, and another important type of paradigm in MI: Generative Learning
Generative Learning will be broken down into Imitating Data, generative learning systems (Dall-E), Generative Adversarial Networks, and Sequence Generation Models (ChatGPT)
There will also be discussion of the power of scaling with data and computation over scaling with people.

Machine Learning Pipeline

The steps are training and prediction.
Training extracts info/knowledge from data to build the model.
Prediction uses the model to answer an input query.
Labels in ML examples provide the model with the question it should answer.
The label is the correct answer/output/prediction for a given input.

YouTube Video Recommendation example

For YouTube video recommendations, input data might include user ID, current viewing video, watch history, liked videos, and disliked videos.
The output is a list of top-K recommended videos

Google Lens example

The input is an image like a 2x2 Rubik's cube,
The output provides links to online stores selling similar items via object detection

MNIST example

MNIST, a commonly used ML dataset for research, consists of handwritten digits from 0 to 9,
It's used in real-world applications to read ZIP codes in postal services and check amounts in bank accounts.

Predicting text articles and movie reviews

Language tasks include classifying news articles by topic, classifying movie reviews as positive or negative.
There is also text generation based on input text.

Language Data

Before Large Language Models, ML focused on language, labeled "Natural Language Processing" (NLP).
Large Language Models are huge Neural Nets with Billions of connections & neurons trained mostly on text data.
They then output sentences

Bag of Words

The "Bag of Words" representation converts text to numbers for processing in ML models.
The goal is to provide calculations
Word order and importance can matter in language data.
"Near" can have many word associations

Count-Based Models and Vector Creation:

Count-based models convert words to numbers based on relationships, building a data table with rows and columns for each word.
We increment counts based on observed proximity, creating a vector representing word relationships.
This list of numbers creates a number vector.
Count-based models can show how word meanings shift over time.

Machine Learning Models

The model output is a number or a collection of numbers.
Classification models output a category, or a discrete value.
Regression models output some value that is a continuous value of the input.
Models answer questions based on input data

Classification Model

In classification, the output is a category, or a discrete label.
Examples include digit recognition (0-9), object detection (cube), and sentiment analysis (positive/negative).

Regression Model

Regression models work give output with a continuous value of the input, such as scoring digit writing, pricing, or rating a movie.

Model Output

The output of these models can be a variety of categories and values

Chat GPT

In ChatGPT data refers to the information used to train the model, like large collections of text from the internet, books, and other written sources

Machine Translation

Machine translation involves generating text output based on a given input.

Large Language models

LLMs convert large amounts of text into manageable word vectors and use context windows to predict the next word.

Attention

LLMs learn to focus on key words through a process, "attention", improving understanding.

Supervised Learning

Supervised learning, an ML framework, learns from labeled data where each input pairs with a label or correct answer.
The labels provide descriptive tags or values.
Supervised learning takes direction from the people selecting the answers

Regression

Labels in regression are numerical values, like scores, prices, or ratings.
Supervised learning models have no agency and cannot be better than the data they are given.

Imitation

Supervised learning systems and expert systems are imitation machines.
ML Models are much easier to update.

Features

Features, like temperature, precipitation, day of the week, and clothing, are numbers or categorical data inputted into the model.
Models should generalize predictions on new data.

Generalization

Generalization makes predictions on new unseen data

Decision Trees

To get around this, decision trees, a classical machine learning model, can be implemented to find trends in data
In this model temperature is noted against the data.
They can be used for both classification and regression.

Decision Trees Pros and Cons

Decision trees are noted as having high explainability but struggle against higher, more complex data implementation and learning
It is important to remember how and when the systems work for this, and those of you in INT-D 161 will use powerful ML models

Other Learning

Supervised learning is the most basic paradigm in ML but there are other learning systems.
Other learning systems include Generative and Reinforcement Learning.

Lecture Topic Summaries

Data and models are crucial ML building blocks.
Data encoding depends on data type.
Supervised learning imitates human answers.
Decision trees are explainable classical models.

Sample Basic Models

Artificial Neural Network: The model is composed of layers of interconnected nodes, or neurons, that process and transmit information, it is comprised of an Input Layer, Hidden Layers, and an Output Layer
Decision Tree: A decision tree is a structured model that uses a series of binary decisions to classify or predict outcomes based on input features or attributes, it navigates based on conditions like temperature or weather, it then derives certain conditions
Perceptron: A perceptron is a single-layer neural network that performs binary classification by applying weights to input features and using a threshold to make a decision, it includes inputs, a Heavyside Step-function, and an output

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

INTD 161 Pt 3

Choose a study mode

Podcast

Questions and Answers

How can AI encode bias, as exemplified by the Dall-E image generation model?

What is the primary distinction between AI and ML based on the lecture?

Which of the following best describes the role of 'labels' in machine learning?

What is the role of data in machine learning models, according to the lecture?

An ML model is presented with an image of an animal and outputs 'dog'. What type of ML model is it, and what kind of label does it use?

In the context of machine learning, what does it mean for a model to 'generalize' well?

Which statement accurately describes the utilization of decision trees in machine learning?

What is a primary advantage of using decision trees in machine learning?

What does the 'Bag of Words' representation primarily aim to achieve in natural language processing?

In the context of count-based models in NLP, what does the term 'near' typically refer to when incrementing the count in each cell?

What is the advantage of updating ML models versus expert systems when inaccuracies are found?

Which of the following is a task that can be approached using supervised learning?

How do Large Language Models (LLMs) predict the next word in a sentence?

In the context of supervised learning, what is the source of the data's labels used to train the models?

Why is it important to be careful about bias in data used for AI systems?

Which of the following tasks is best best described as AI but not ML?

Which of the following tasks is best described as both AI and ML?

Which of the following best describes the relationship between data and ML model?

How do you input a picture into an ML model and what type of model does it use?

What is the purpose of labels in ML model?

What is the difference(s) between Classification and Regression?

What is unsupervised learning?

What did LLMs learn?

According to the video, what is one of the most basic paradigms in ML?

ML models are better at the examples we give it, what is the name of it?

Decision trees are what type of machine to use to separate the data?

What will happen if there is more data to create ML models to do a prediction?

What happened if the temperature is -5, with Snow, on Wednesday wearing casual, what will the label show?

What are decision trees?

Which of the following best describes the AI system MYCIN?

What is needed when the AI system is inaccurate?

The weather man forecast the weather as raining with 100 percentage and thunderstorm for Tuesday at 10:00AM, what are the category and values?

Which statement is true regarding data and ML?

What does count-based models mean for NLP?

If you provide a cat list of pictures and a dog list of pictures. Then the ML model shows a dinosaur. What do you call that?

What aspect of AI systems requires careful attention to prevent biased outputs?

What term describes the concept of data in machine learning?

In the machine learning pipeline, what is the purpose of the 'Training' stage?

In a machine learning context, what is the role of 'labels'?

What is the primary objective of converting text into numerical representations in NLP?

In count-based models for NLP, what does incrementing the count in a cell signify?

What is a key characteristic of supervised learning?

What is the practical implication of the statement that an ML model is an 'imitation machine'?

What is the core difference between how an expert system and a supervised learning system provide answers?

Why are ML models easier to update when inaccuracies are found, compared to expert systems?

What are the main components to consider when determining the type of transportation a person is using?

What does it mean for a machine learning model to 'generalize' well?

In the context of the YouTube video recommendation system, what serves as the 'input' for the ML model?

In Google Lens, what is the primary role of machine learning?

What best describes the role of data in ChatGPT?

Consider the statement: 'I had to take my _____ to the vet.' What concept does this illustrate in language models?

In Large Language Models (LLMs), what is the purpose of the 'context window'?

Given the data of temperature, precipitation, day and clothes; What type of ML is used to produce the output (label)?

How would you describe passive imitation?

What does it mean for ML to be great explainability?

When should we be cautious of bias when training data?

When is assignment #2 due?

What does Building blocks of ML consists of?

Why does 3 submissions exist for assignment #2?

Where did the labels/answers come mostly from in Supervised Learning?

Which of these applications of AI is also ML?

What does Data consists of?

How to you input a picture into an ML model?

What does classify do regarding the types of news article?

What are the amount of options when classifying a review type of postive or negative?

What does 'Large' stand for in LLM?

In Count Based Models, what does 'near' mean?

Which of this option will the statement refer to 'Cats will Escape'?

Given the temp is high, predict the label. Temp > 0

What does Generalize mean related to ML?

What is a primary limitation of a perceptron?

What distinguishes generative learning from supervised learning?

Which factor does not significantly influence the decision-making when choosing between different ML models?

What is the primary advantage of using generative learning with unlabeled data?

In the context of Large Language Models (LLMs), what is the purpose of the 'context window'?

What characteristic defines sequence generation models, such as those used for video generation?