Podcast
Questions and Answers
Which type of polymorphism involves changes in individual bases?
Which type of polymorphism involves changes in individual bases?
- Structural variants
- Indels
- Copy Number Variants
- SNPs (correct)
Structural variants include changes that are greater than 1 kb.
Structural variants include changes that are greater than 1 kb.
True (A)
What is the primary purpose of genotyping?
What is the primary purpose of genotyping?
To determine the alleles an individual has at specific positions.
The method of determining the number of copies of specific elements in the genome is known as __________.
The method of determining the number of copies of specific elements in the genome is known as __________.
Which of the following methods is NOT typically used for genotyping?
Which of the following methods is NOT typically used for genotyping?
Match the type of polymorphism with its description:
Match the type of polymorphism with its description:
The addition or subtraction of less than 50 bp is classified as __________.
The addition or subtraction of less than 50 bp is classified as __________.
Name one challenge associated with genotyping.
Name one challenge associated with genotyping.
What is the primary goal of genotype mapping?
What is the primary goal of genotype mapping?
Phased genotypes refer to the genetic material that originates from only one parent.
Phased genotypes refer to the genetic material that originates from only one parent.
What is the purpose of genome imputation?
What is the purpose of genome imputation?
The process of filling in missing data with an allele during genotyping is known as _____
The process of filling in missing data with an allele during genotyping is known as _____
Match the following software with their primary purpose in genotyping:
Match the following software with their primary purpose in genotyping:
Which software is primarily used for polymorphism discovery and genotyping?
Which software is primarily used for polymorphism discovery and genotyping?
Deep Variant is known for its accuracy in genotyping.
Deep Variant is known for its accuracy in genotyping.
What are two main reasons for genotyping individuals?
What are two main reasons for genotyping individuals?
The area of continued development and improvement in SNP calling is _______.
The area of continued development and improvement in SNP calling is _______.
Match the SNP calling software with its primary feature:
Match the SNP calling software with its primary feature:
What is a challenge in polymorphism discovery?
What is a challenge in polymorphism discovery?
Sequencing errors have no impact on polymorphism discovery.
Sequencing errors have no impact on polymorphism discovery.
What step does GATK involve to improve its SNP calling accuracy?
What step does GATK involve to improve its SNP calling accuracy?
Which of the following methods are NOT high throughput?
Which of the following methods are NOT high throughput?
Sequencing-based genotyping typically offers consistent genotype calls at known SNP positions.
Sequencing-based genotyping typically offers consistent genotype calls at known SNP positions.
What is the term used for low coverage genome sequencing that relies heavily on imputing missing data?
What is the term used for low coverage genome sequencing that relies heavily on imputing missing data?
The _______ sequencing method enables high-quality genotype calls for an individual by focusing on known polymorphic positions.
The _______ sequencing method enables high-quality genotype calls for an individual by focusing on known polymorphic positions.
Match the sequencing techniques with their descriptions:
Match the sequencing techniques with their descriptions:
Which genotyping method separates discovery and genotyping?
Which genotyping method separates discovery and genotyping?
What is one disadvantage of sequencing-based genotyping compared to other methods?
What is one disadvantage of sequencing-based genotyping compared to other methods?
The cost per individual in sequencing-based approaches is generally high compared to other methods.
The cost per individual in sequencing-based approaches is generally high compared to other methods.
Flashcards
Polymorphism
Polymorphism
A difference in an individual's genome compared to a reference genome.
SNPs
SNPs
Single nucleotide polymorphisms; changes in a single DNA base.
Indels
Indels
Insertions or deletions of one or more DNA bases.
Copy Number Variants (CNVs)
Copy Number Variants (CNVs)
Signup and view all the flashcards
Structural Variants
Structural Variants
Signup and view all the flashcards
Genotyping
Genotyping
Signup and view all the flashcards
Genome sequencing-based genotyping
Genome sequencing-based genotyping
Signup and view all the flashcards
Genome Imputation
Genome Imputation
Signup and view all the flashcards
Microarray-based genotyping
Microarray-based genotyping
Signup and view all the flashcards
PCR-based genotyping
PCR-based genotyping
Signup and view all the flashcards
DNA Sequencing-Based Genotyping
DNA Sequencing-Based Genotyping
Signup and view all the flashcards
Whole genome sequencing (low-pass)
Whole genome sequencing (low-pass)
Signup and view all the flashcards
Targeted amplification
Targeted amplification
Signup and view all the flashcards
Reduced representation sequencing
Reduced representation sequencing
Signup and view all the flashcards
Genotyping-By-Sequencing (GBS)
Genotyping-By-Sequencing (GBS)
Signup and view all the flashcards
Polymorphism discovery
Polymorphism discovery
Signup and view all the flashcards
SNP calling software
SNP calling software
Signup and view all the flashcards
Samtools
Samtools
Signup and view all the flashcards
GATK
GATK
Signup and view all the flashcards
Deep Variant
Deep Variant
Signup and view all the flashcards
Sequencing errors
Sequencing errors
Signup and view all the flashcards
Alignment errors
Alignment errors
Signup and view all the flashcards
Genomic complexity
Genomic complexity
Signup and view all the flashcards
Genotype-Phenotype relationship
Genotype-Phenotype relationship
Signup and view all the flashcards
Phased Genotypes
Phased Genotypes
Signup and view all the flashcards
Reference Panel
Reference Panel
Signup and view all the flashcards
Parental Haplotype Maximization
Parental Haplotype Maximization
Signup and view all the flashcards
Why is phasing important?
Why is phasing important?
Signup and view all the flashcards
Study Notes
Lecture 10a - Polymorphism Discovery and Genotyping - BIO4BI3 Bioinformatics
- Lecture covered polymorphism discovery and genotyping methods within the context of bioinformatics.
- The workflow encompasses DNA sequencing, quality control, assembly, genome annotation, expression analysis, marker-trait associations, population analysis, genotyping, and polymorphism discovery.
Learning Objectives
- Understand polymorphisms and their four broad types.
- Describe genotyping and the different methods used.
- Identify some genotyping software tools.
- Recognize challenges associated with genotyping.
- Explain genome imputation and its various approaches.
Types of Polymorphisms
- A polymorphism is a difference in an individual's genome relative to a reference genome.
- Genomes evolve through mutation, duplication, and deletion.
- Four broad categories of polymorphisms:
- SNPs (single nucleotide polymorphisms): Changes in single bases within a genome.
- Indels (insertions/deletions): Insertion or deletion of 1 or more bases compared to a reference.
- CNVs (copy number variations): Differences in the number of copies of a genetic element.
- Structural variants: Changes larger than 1kb, including insertions, deletions, inversions, translocations, and rearrangements.
What is Genotyping
- Genotyping is determining the alleles at specific positions in an individual's genome relative to a reference.
- Genotyping positions are often predetermined and separate from polymorphism discovery.
- Several methods exist for genotyping:
- Genome sequencing-based methods (low cost per data point, whole genome sequencing).
- Microarray-based genotyping (fixed panels of SNPs).
- PCR-derived methods (Tag-Man, KASP, HRM - rapid, reliable, lower throughput).
DNA Sequencing-Based Genotyping
- Advances in sequencing technology lead to more sequencing-based genotyping approaches.
- Approaches include whole-genome sequencing (low coverage), targeted amplification, and reduced representation.
- Exome sequencing and GBS are examples of reduced representation methods.
Genotyping-by-Sequencing
- A detailed diagram of the genotyping-by-sequencing process was provided.
- Steps include: Constructing restricted representation libraries (RRLs) using restriction enzymes, ligating custom barcoded adaptors, pooling samples, performing PCR amplification, and using barcodes to assign sequences and produce DNA sequence data for each sample.
Polymorphism Discovery vs. Genotyping
- Genotyping approaches (microarrays, PCR) separate polymorphism discovery and genotyping.
- Sequencing-based approaches (GBS, exome) perform both simultaneously.
- Genotyping known positions provides highly consistent results, but often involves technological investment.
- Sequencing-based approaches are typically cost-effective per individual but may not consistently produce genotypes at the same positions.
Tools for Discovery and Genotyping
- Modern polymorphism discovery utilizes whole-genome sequencing and long-read technologies.
- Sequencing-based genotyping relies on mapping sequencing reads to a reference genome. Key software tools like Samtools, GATK, and Deep Variant assist in calling SNPs/variants.
- GATK pipeline details the steps involved in processing DNA.
Deep Variant
- A deep neural network approach used for SNP/variant discovery and genotyping.
- Claims greater accuracy than other approaches (e.g., GATK) by analyzing image representations of sequencing data.
Discovery Challenges
- Sequencing errors impact read mapping, reducing reliability of results.
- Alignment errors and complex genomes create challenges in accuracy.
- Genotyping in polyploidy genomes is complicated; requires more genome coverage for sufficient statistical certainty.
- Genome duplications introduce challenges in accurate genotype determination.
Why We Genotype
- Understand genetic relationships among individuals.
- Understand genetic diversity in a population.
- Undertake genealogical research.
- Predict phenotype from genotype through disease classification, marker-assisted backcrossing, or genomic selection.
- Perform genome-wide association studies to identify phenotype/genotype relationships.
Phased Genotyping
- Genomes usually contain one maternal and one paternal copy of each chromosome.
- Phased genotyping aims to determine which copy came from which parent for specific genomic locations.
- Phased assembly is increasingly applied toward heterozygous individuals and enables more accurate imputation.
Imputing Genotypes
- All genotyping methods have data gaps.
- Genome imputation fills in missing data based on a reference panel of known genotypes, improving data completeness.
- Imputation algorithms use statistical relationships, statistical allele frequencies, and parental haplotype maximization.
- Common imputation software includes PLINK, BEAGLE, MACH, and GLIMPSE.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
This quiz covers essential concepts in polymorphism discovery and genotyping methods in bioinformatics. It explores DNA sequencing, quality control, genome annotation, and the challenges associated with genotyping. Test your understanding of polymorphisms and the tools used in this field.