Introduction to Machine Learning

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Listen to an AI-generated conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Machine Learning algorithms can help simplify code for complex problems.

True (A)

Machine learning is exclusively useful for problems with existing solutions.

False (B)

Online learning systems learn incrementally from data.

True (A)

In reinforcement learning, the system is trained with a labeled dataset.

<p>False (B)</p>
Signup and view all the answers

In batch learning, the system is capable of learning incrementally.

<p>False (B)</p>
Signup and view all the answers

Machine Learning can only be used with a traditional approach.

<p>False (B)</p>
Signup and view all the answers

Semi-supervised learning uses only labeled data.

<p>False (B)</p>
Signup and view all the answers

The most important factor for Machine Learning is the type of algorithm.

<p>False (B)</p>
Signup and view all the answers

Machine Learning requires labelled data.

<p>False (B)</p>
Signup and view all the answers

In supervised learning, training data includes the desired solutions, called labels.

<p>True (A)</p>
Signup and view all the answers

Linear Regression is an example of an unsupervised learning algorithm.

<p>False (B)</p>
Signup and view all the answers

K-Means clustering is a supervised learning algorithm.

<p>False (B)</p>
Signup and view all the answers

A spam filter classifying emails as spam or not spam is an example of regression.

<p>False (B)</p>
Signup and view all the answers

Instance-based learning involves building an explicit model from the training data.

<p>False (B)</p>
Signup and view all the answers

Machine Learning can learn from data.

<p>True (A)</p>
Signup and view all the answers

Flashcards

What is Machine Learning?

The science of programming computers so they can learn from data.

Why use Machine Learning?

Simplifies code, performs better, and adapts to new data.

Supervised Learning

ML systems trained with human supervision.

Unsupervised Learning

ML systems trained without human supervision.

Signup and view all the flashcards

Semi-Supervised Learning

Uses partially labeled training data.

Signup and view all the flashcards

Reinforcement learning

Agent learns to select actions based on rewards/penalties.

Signup and view all the flashcards

Batch Learning

Learning using all available data and applying only what it has learned. Done offline.

Signup and view all the flashcards

Online Learning

Learning incrementally from a stream of incoming data. Each step is fast and cheap.

Signup and view all the flashcards

Instance-Based Learning

Generalizes to new instances by comparing them to learned examples.

Signup and view all the flashcards

Model-Based Learning

Builds a model from training examples and uses it to make predictions.

Signup and view all the flashcards

Classification

Supervised task of predicting categories.

Signup and view all the flashcards

Regression

Predicting a target numeric value given a set of features.

Signup and view all the flashcards

Clustering

Group data into clusters.

Signup and view all the flashcards

Anomaly Detection

Detects unusual data points.

Signup and view all the flashcards

Generalization

Most Machine Learning tasks focused on predictions using model to generalized.

Signup and view all the flashcards

Study Notes

Machine Learning

  • Machine Learning (ML) is a field where computers are programmed to learn from data
  • ML enables computers to learn without explicit programming
  • A computer program learns from experience (E) regarding a task (T) and performance measure (P)
  • Performance on T, measured by P, improves with experience E

Why Use Machine Learning?

  • ML is used instead of traditional programming for spam filters
  • Traditional spam filters require writing detection algorithms for patterns in spam emails
  • The program flags emails as spam based on detected patterns
  • These steps are repeated until the program performs to a sufficient degree
  • ML can automatically adapt to changes over time, such as new spam techniques

Uses for Machine Learning

  • ML is useful for problems that require extensive hand-tuning or long lists of rules
  • One such Machine Learning algorithm simplifies code while improving performance
  • ML is used for complex problems without good solutions using traditional approaches
  • The best Machine Learning techniques find solutions for these hard problems
  • ML is used for fluctuating environments, where a Machine Learning system adapts to new data
  • ML is used for complex problems and large amounts of data to gain insights

Types of Machine Learning Systems

  • They can be classified based on various criteria
  • ML systems can be classified based on training with human supervision
  • Supervised
  • Unsupervised
  • Semi-supervised
  • Reinforcement Learning
  • ML systems can be classified based on whether they learn incrementally
  • Online versus batch learning
  • ML systems can be classified based on how they work
  • Comparing new data points to known data points
  • Detecting patterns and building a predictive model
  • Instance-based versus model-based learning

Supervised Learning

  • ML systems are classified according to the amount and type of supervision they get during training
  • Supervised learning includes training data fed to the algorithm that includes the desired solutions
  • A typical supervised learning task is classification, such as a spam filter that learns to classify new emails
  • Another task is predicting a target numeric value based on predictors; this is called regression

Supervised Learning Algorithms

  • k-Nearest Neighbors
  • Linear Regression
  • Logistic Regression
  • Support Vector Machines (SVMs)
  • Decision Trees and Random Forests
  • Neural networks

Unsupervised Learning

  • Unsupervised Learning involves training data that is unlabeled
  • Algorithms:
  • Clustering
    • K-Means
    • DBSCAN
    • Hierarchical Cluster Analysis (HCA)
  • Anomaly detection and novelty detection
  • One-class SVM
  • Isolation Forest

Semi-Supervised Learning

  • Semi-supervised learning deals with partially labeled training data
  • It usually involves a lot of unlabeled data with a little bit of labeled data

Reinforcement Learning

  • Reinforcement Learning is a very different system
  • An agent observes the environment, selects, and performs actions
  • The agent can get rewards in return (or penalties in the form of negative rewards)
  • The agent learns by itself the best strategy (aka a policy) to get the most reward over time
  • A policy defines what action the agent should choose in a given situation

Batch and Online Learning

  • Another criterion to classify ML systems is whether or not they can learn incrementally
  • Batch learning systems can not learn incrementally
  • It is trained using all available data and is typically done offline
  • Online learning trains the system incrementally by feeding it data instances sequentially
  • Learning steps are fast and cheap, so the system can learn about new data on the fly

Instance-Based vs Model-Based Learning

  • ML systems can be categorized by how they generalize
  • This involves making predictions based on instance-based training examples
  • Instance-based learning means the system needs to generalize to previously unseen examples
  • The true goal is performing well on new instances
  • Two main approaches to generalization are instance-based learning and model-based learning

Instance-Based Learning

  • Instance-based learning entails a system learning examples by heart
  • It then generalizes to new cases by comparing them to the learned examples

Model-Based Learning

  • Build a model from a set of examples
  • Use that model to make predictions
  • This is called model based learning

Main Challenges of Machine Learning

  • Select a learning algorithm and train it on some data
  • Issues include "bad algorithm" and "bad data"
  • Insufficient Quantity of Training Data
  • Nonrepresentative Training Data
  • Poor Quality Data
  • Irrelevant Features
  • Overfitting the Training Data
  • Underfitting the Training Data
  • Stepping Back
  • Testing and Validating

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Common Algorithms in Machine Learning
24 questions
Intro to Machine Learning
20 questions

Intro to Machine Learning

SurrealThermodynamics avatar
SurrealThermodynamics
Introduction to Machine Learning
10 questions
Use Quizgecko on...
Browser
Browser