Lecture 10a Polymorphism Discovery and Genotyping
29 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which type of polymorphism involves changes in individual bases?

  • Structural variants
  • Indels
  • Copy Number Variants
  • SNPs (correct)

Structural variants include changes that are greater than 1 kb.

True (A)

What is the primary purpose of genotyping?

To determine the alleles an individual has at specific positions.

The method of determining the number of copies of specific elements in the genome is known as __________.

<p>Copy Number Variants</p> Signup and view all the answers

Which of the following methods is NOT typically used for genotyping?

<p>Gene expression analysis (A)</p> Signup and view all the answers

Match the type of polymorphism with its description:

<p>SNPs = Changes in individual bases Indels = Insertions or deletions of bases CNVs = Differences in the number of copies of a genomic element Structural variants = Changes greater than 1 kb</p> Signup and view all the answers

The addition or subtraction of less than 50 bp is classified as __________.

<p>Indels</p> Signup and view all the answers

Name one challenge associated with genotyping.

<p>Sequencing errors or data interpretation difficulties.</p> Signup and view all the answers

What is the primary goal of genotype mapping?

<p>To understand the relationship between phenotype and genotype (C)</p> Signup and view all the answers

Phased genotypes refer to the genetic material that originates from only one parent.

<p>False (B)</p> Signup and view all the answers

What is the purpose of genome imputation?

<p>To fill in the missing data in genotyping with an allele.</p> Signup and view all the answers

The process of filling in missing data with an allele during genotyping is known as _____

<p>genome imputation</p> Signup and view all the answers

Match the following software with their primary purpose in genotyping:

<p>PLINK = Genetic data analysis BEAGLE = Genotype imputation MACH = Genotype imputation GLIMPSE = Genotype imputation</p> Signup and view all the answers

Which software is primarily used for polymorphism discovery and genotyping?

<p>GATK (B)</p> Signup and view all the answers

Deep Variant is known for its accuracy in genotyping.

<p>True (A)</p> Signup and view all the answers

What are two main reasons for genotyping individuals?

<p>Understanding genetic relationships and the genotype-phenotype relationship.</p> Signup and view all the answers

The area of continued development and improvement in SNP calling is _______.

<p>software</p> Signup and view all the answers

Match the SNP calling software with its primary feature:

<p>Samtools = Determining alleles at specific positions GATK = Polymorphism discovery and genotyping Deep Variant = Using deep neural networks for accuracy</p> Signup and view all the answers

What is a challenge in polymorphism discovery?

<p>All of the above (D)</p> Signup and view all the answers

Sequencing errors have no impact on polymorphism discovery.

<p>False (B)</p> Signup and view all the answers

What step does GATK involve to improve its SNP calling accuracy?

<p>Post-read mapping for local realignments.</p> Signup and view all the answers

Which of the following methods are NOT high throughput?

<p>Taq-Man and KASP (C)</p> Signup and view all the answers

Sequencing-based genotyping typically offers consistent genotype calls at known SNP positions.

<p>False (B)</p> Signup and view all the answers

What is the term used for low coverage genome sequencing that relies heavily on imputing missing data?

<p>low-pass sequencing</p> Signup and view all the answers

The _______ sequencing method enables high-quality genotype calls for an individual by focusing on known polymorphic positions.

<p>microarray-based</p> Signup and view all the answers

Match the sequencing techniques with their descriptions:

<p>Whole genome sequencing = Complete genome at low coverage Genotyping-By-Sequencing = Simultaneous polymorphism discovery and genotyping Reduced representation sequencing = Consistent subset of the genome Targeted amplification = PCR regions amplified for sequencing</p> Signup and view all the answers

Which genotyping method separates discovery and genotyping?

<p>Microarray-based genotyping (A)</p> Signup and view all the answers

What is one disadvantage of sequencing-based genotyping compared to other methods?

<p>Inconsistent genotype calls at the same positions</p> Signup and view all the answers

The cost per individual in sequencing-based approaches is generally high compared to other methods.

<p>False (B)</p> Signup and view all the answers

Flashcards

Polymorphism

A difference in an individual's genome compared to a reference genome.

SNPs

Single nucleotide polymorphisms; changes in a single DNA base.

Indels

Insertions or deletions of one or more DNA bases.

Copy Number Variants (CNVs)

Differences in the number of copies of a specific DNA segment.

Signup and view all the flashcards

Structural Variants

Large-scale changes in the genome structure greater than 1 kb.

Signup and view all the flashcards

Genotyping

Determining the alleles of an individual at specific positions in a reference genome.

Signup and view all the flashcards

Genome sequencing-based genotyping

Using genome sequencing data to identify alleles at specific positions.

Signup and view all the flashcards

Genome Imputation

Estimating the genotypes of missing positions in a genome.

Signup and view all the flashcards

Microarray-based genotyping

A method to identify specific genetic variations (SNPs) using predefined microarrays.

Signup and view all the flashcards

PCR-based genotyping

Methods like TaqMan, KASP, and HRM that use PCR to identify specific genetic variations.

Signup and view all the flashcards

DNA Sequencing-Based Genotyping

Genotyping method that uses DNA sequencing to identify variations in a genome.

Signup and view all the flashcards

Whole genome sequencing (low-pass)

Sequencing the complete genome with limited coverage (less than 1X).

Signup and view all the flashcards

Targeted amplification

Sequencing specific areas of the genome using PCR primers.

Signup and view all the flashcards

Reduced representation sequencing

Sequencing a designed subset of the genome.

Signup and view all the flashcards

Genotyping-By-Sequencing (GBS)

A reduced representation technique for sequencing.

Signup and view all the flashcards

Polymorphism discovery

Identifying new genetic variations in a genome.

Signup and view all the flashcards

SNP calling software

Software used to identify single nucleotide polymorphisms (SNPs) in sequenced data.

Signup and view all the flashcards

Samtools

Software for determining alleles at specific DNA positions, but not for identifying new polymorphisms.

Signup and view all the flashcards

GATK

Popular software for discovering and genotyping polymorphisms, using high-quality read data and alignment comparisons across multiple individuals.

Signup and view all the flashcards

Deep Variant

Google's deep learning approach for SNP discovery and genotyping. It analyzes aligned read images for accuracy.

Signup and view all the flashcards

Sequencing errors

Errors in sequencing that can lead to incorrect SNP discovery and genotyping.

Signup and view all the flashcards

Alignment errors

Mistakes in aligning sequenced reads to the reference genome, leading to inaccuracies in SNP calling.

Signup and view all the flashcards

Genomic complexity

Challenges in SNP discovery and genotyping in complex genomes, like polyploidy, requiring more sequencing coverage.

Signup and view all the flashcards

Genotype-Phenotype relationship

The study of how genotypes (genetic variations) are related to phenotypes (observable traits).

Signup and view all the flashcards

Phased Genotypes

Genotypes where we know which chromosome copy (maternal or paternal) each allele came from.

Signup and view all the flashcards

Reference Panel

A collection of individuals with high-quality genotypes representing the diversity in the pedigree of the sample being genotyped.

Signup and view all the flashcards

Parental Haplotype Maximization

An imputation algorithm that prioritizes alleles that maximize the likelihood of parental haplotype inheritance.

Signup and view all the flashcards

Why is phasing important?

Phasing allows us to understand the parental origin of polymorphisms and how they move together during meiosis.

Signup and view all the flashcards

Study Notes

Lecture 10a - Polymorphism Discovery and Genotyping - BIO4BI3 Bioinformatics

  • Lecture covered polymorphism discovery and genotyping methods within the context of bioinformatics.
  • The workflow encompasses DNA sequencing, quality control, assembly, genome annotation, expression analysis, marker-trait associations, population analysis, genotyping, and polymorphism discovery.

Learning Objectives

  • Understand polymorphisms and their four broad types.
  • Describe genotyping and the different methods used.
  • Identify some genotyping software tools.
  • Recognize challenges associated with genotyping.
  • Explain genome imputation and its various approaches.

Types of Polymorphisms

  • A polymorphism is a difference in an individual's genome relative to a reference genome.
  • Genomes evolve through mutation, duplication, and deletion.
  • Four broad categories of polymorphisms:
    • SNPs (single nucleotide polymorphisms): Changes in single bases within a genome.
    • Indels (insertions/deletions): Insertion or deletion of 1 or more bases compared to a reference.
    • CNVs (copy number variations): Differences in the number of copies of a genetic element.
    • Structural variants: Changes larger than 1kb, including insertions, deletions, inversions, translocations, and rearrangements.

What is Genotyping

  • Genotyping is determining the alleles at specific positions in an individual's genome relative to a reference.
  • Genotyping positions are often predetermined and separate from polymorphism discovery.
  • Several methods exist for genotyping:
    • Genome sequencing-based methods (low cost per data point, whole genome sequencing).
    • Microarray-based genotyping (fixed panels of SNPs).
    • PCR-derived methods (Tag-Man, KASP, HRM - rapid, reliable, lower throughput).

DNA Sequencing-Based Genotyping

  • Advances in sequencing technology lead to more sequencing-based genotyping approaches.
  • Approaches include whole-genome sequencing (low coverage), targeted amplification, and reduced representation.
  • Exome sequencing and GBS are examples of reduced representation methods.

Genotyping-by-Sequencing

  • A detailed diagram of the genotyping-by-sequencing process was provided.
  • Steps include: Constructing restricted representation libraries (RRLs) using restriction enzymes, ligating custom barcoded adaptors, pooling samples, performing PCR amplification, and using barcodes to assign sequences and produce DNA sequence data for each sample.

Polymorphism Discovery vs. Genotyping

  • Genotyping approaches (microarrays, PCR) separate polymorphism discovery and genotyping.
  • Sequencing-based approaches (GBS, exome) perform both simultaneously.
  • Genotyping known positions provides highly consistent results, but often involves technological investment.
  • Sequencing-based approaches are typically cost-effective per individual but may not consistently produce genotypes at the same positions.

Tools for Discovery and Genotyping

  • Modern polymorphism discovery utilizes whole-genome sequencing and long-read technologies.
  • Sequencing-based genotyping relies on mapping sequencing reads to a reference genome. Key software tools like Samtools, GATK, and Deep Variant assist in calling SNPs/variants.
  • GATK pipeline details the steps involved in processing DNA.

Deep Variant

  • A deep neural network approach used for SNP/variant discovery and genotyping.
  • Claims greater accuracy than other approaches (e.g., GATK) by analyzing image representations of sequencing data.

Discovery Challenges

  • Sequencing errors impact read mapping, reducing reliability of results.
  • Alignment errors and complex genomes create challenges in accuracy.
  • Genotyping in polyploidy genomes is complicated; requires more genome coverage for sufficient statistical certainty.
  • Genome duplications introduce challenges in accurate genotype determination.

Why We Genotype

  • Understand genetic relationships among individuals.
  • Understand genetic diversity in a population.
  • Undertake genealogical research.
  • Predict phenotype from genotype through disease classification, marker-assisted backcrossing, or genomic selection.
  • Perform genome-wide association studies to identify phenotype/genotype relationships.

Phased Genotyping

  • Genomes usually contain one maternal and one paternal copy of each chromosome.
  • Phased genotyping aims to determine which copy came from which parent for specific genomic locations.
  • Phased assembly is increasingly applied toward heterozygous individuals and enables more accurate imputation.

Imputing Genotypes

  • All genotyping methods have data gaps.
  • Genome imputation fills in missing data based on a reference panel of known genotypes, improving data completeness.
  • Imputation algorithms use statistical relationships, statistical allele frequencies, and parental haplotype maximization.
  • Common imputation software includes PLINK, BEAGLE, MACH, and GLIMPSE.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Description

This quiz covers essential concepts in polymorphism discovery and genotyping methods in bioinformatics. It explores DNA sequencing, quality control, genome annotation, and the challenges associated with genotyping. Test your understanding of polymorphisms and the tools used in this field.

More Like This

Java Polymorphism and Static vs
3 questions
Polymorphism in Physical Pharmacy Quiz
10 questions
Python Polymorphism
16 questions

Python Polymorphism

SplendidFrancium avatar
SplendidFrancium
Use Quizgecko on...
Browser
Browser