Podcast
Questions and Answers
In Illumina sequencing, what is the primary purpose of cleaving the P5 region after solid-phase amplification?
In Illumina sequencing, what is the primary purpose of cleaving the P5 region after solid-phase amplification?
- To allow the clusters to be sequenced in the opposite direction.
- To label the nucleotides for sequencing.
- To ensure all copies of the DNA fragments are sequenced in the same direction. (correct)
- To prevent bridge amplification from occurring.
In Illumina paired-end sequencing, the second read is sequenced from the same strand as the first read.
In Illumina paired-end sequencing, the second read is sequenced from the same strand as the first read.
False (B)
What is the term used to describe the process by which millions of identical DNA fragments are generated in unique locations on the flow cell surface during Illumina sequencing?
What is the term used to describe the process by which millions of identical DNA fragments are generated in unique locations on the flow cell surface during Illumina sequencing?
Bridge Amplification
In Illumina sequencing, clusters of DNA fragments are ______ amplified via bridge amplification.
In Illumina sequencing, clusters of DNA fragments are ______ amplified via bridge amplification.
Match the following sequencing-related terms with their descriptions:
Match the following sequencing-related terms with their descriptions:
A researcher aims to sequence all the genetic information of a newly discovered bacterium, including both coding and non-coding regions. Which NGS assay is most appropriate?
A researcher aims to sequence all the genetic information of a newly discovered bacterium, including both coding and non-coding regions. Which NGS assay is most appropriate?
Whole-Exome Sequencing (WES) targets the entire genome, including both exons and introns.
Whole-Exome Sequencing (WES) targets the entire genome, including both exons and introns.
A scientist is interested in studying gene expression patterns in a cell under different environmental conditions. Which NGS assay would be most suitable?
A scientist is interested in studying gene expression patterns in a cell under different environmental conditions. Which NGS assay would be most suitable?
In NGS assays, library preparation involves selecting for the desired pool of ______ from fragmented DNA or PCR amplicons.
In NGS assays, library preparation involves selecting for the desired pool of ______ from fragmented DNA or PCR amplicons.
Match the following NGS assays with their primary targets:
Match the following NGS assays with their primary targets:
During DNA sequencing using Thermo Fisher Ion Sequencing, what is directly detected by the sensors in the wells?
During DNA sequencing using Thermo Fisher Ion Sequencing, what is directly detected by the sensors in the wells?
In Illumina sequencing, the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand involves the release of a hydroxide ion (OH-).
In Illumina sequencing, the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand involves the release of a hydroxide ion (OH-).
What is the role of DNA polymerase in the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand?
What is the role of DNA polymerase in the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand?
In Thermo Fisher Ion Sequencing, fragments of genomic DNA are clonally amplified onto ________ beads before being delivered to individual wells.
In Thermo Fisher Ion Sequencing, fragments of genomic DNA are clonally amplified onto ________ beads before being delivered to individual wells.
Match the sequencing technology with the aspect it measures:
Match the sequencing technology with the aspect it measures:
Which of the following is added to each bead-bound template strand in Thermo Fisher Ion Sequencing?
Which of the following is added to each bead-bound template strand in Thermo Fisher Ion Sequencing?
Next-generation sequencing (NGS) assays are designed to measure only a single analyte at a time.
Next-generation sequencing (NGS) assays are designed to measure only a single analyte at a time.
What is released during the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand?
What is released during the incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand?
What is the 'insert size' in the context of fragmented DNA sequencing?
What is the 'insert size' in the context of fragmented DNA sequencing?
In paired-end sequencing, you obtain only one read per DNA fragment.
In paired-end sequencing, you obtain only one read per DNA fragment.
What benefit do paired-end reads provide when aligning sequences to a reference genome, compared to single-end reads?
What benefit do paired-end reads provide when aligning sequences to a reference genome, compared to single-end reads?
If you perform 100bp paired-end sequencing on a 220bp insert, there will be a(n) _______ of 20 bp that is not sequenced.
If you perform 100bp paired-end sequencing on a 220bp insert, there will be a(n) _______ of 20 bp that is not sequenced.
In paired-end sequencing, if your read length is longer than expected based on the insert size, what occurs?
In paired-end sequencing, if your read length is longer than expected based on the insert size, what occurs?
In WGS and WES, after aligning sequencing reads, there is typically an order on the chromosome before further analysis.
In WGS and WES, after aligning sequencing reads, there is typically an order on the chromosome before further analysis.
Match the read type to the sequencing strategy characteristic:
Match the read type to the sequencing strategy characteristic:
Why might reads containing only two words be difficult to align when reconstructing a sentence from sequence data?
Why might reads containing only two words be difficult to align when reconstructing a sentence from sequence data?
Targeted sequencing assays, such as the 395-gene panel, offer which of the following advantages compared to whole-genome sequencing?
Targeted sequencing assays, such as the 395-gene panel, offer which of the following advantages compared to whole-genome sequencing?
Next-generation sequencing (NGS) technologies are unable to identify rare variants due to limitations in coverage depth.
Next-generation sequencing (NGS) technologies are unable to identify rare variants due to limitations in coverage depth.
What is the primary purpose of adapter ligation in the library preparation stage of NGS?
What is the primary purpose of adapter ligation in the library preparation stage of NGS?
In NGS, a combination of a sample and flow cell is specified by a ______.
In NGS, a combination of a sample and flow cell is specified by a ______.
Before adapter ligation, what step is necessary to prepare DNA for NGS?
Before adapter ligation, what step is necessary to prepare DNA for NGS?
Which of the following correctly describes the process after sequencing in NGS?
Which of the following correctly describes the process after sequencing in NGS?
Match each gene with its relevance to cancer studies:
Match each gene with its relevance to cancer studies:
A key benefit of NGS is its ability to sequence only a small proportion of DNA from multiple individuals simultaneously, reducing costs.
A key benefit of NGS is its ability to sequence only a small proportion of DNA from multiple individuals simultaneously, reducing costs.
In the context of sequencing data, what does 'coverage (20x)' indicate?
In the context of sequencing data, what does 'coverage (20x)' indicate?
Variant Allele Fraction (VAF) represents the total number of reads at a site, regardless of whether they contain the variant allele.
Variant Allele Fraction (VAF) represents the total number of reads at a site, regardless of whether they contain the variant allele.
Define 'Depth of Coverage' in Next-Generation Sequencing (NGS).
Define 'Depth of Coverage' in Next-Generation Sequencing (NGS).
In sequencing, a high GC content in certain regions can introduce a ______, making them more difficult to sequence accurately.
In sequencing, a high GC content in certain regions can introduce a ______, making them more difficult to sequence accurately.
Match each sequencing term with its corresponding description:
Match each sequencing term with its corresponding description:
What is the primary goal when analyzing reads in the context of DNA sequencing?
What is the primary goal when analyzing reads in the context of DNA sequencing?
Which factor can introduce bias in Depth of Coverage (DoC) during NGS experiments?
Which factor can introduce bias in Depth of Coverage (DoC) during NGS experiments?
Explain why different exome capture kits can yield different analyzable territories.
Explain why different exome capture kits can yield different analyzable territories.
Flashcards
"De novo" sequencing
"De novo" sequencing
Sequencing without prior knowledge of the target organism's genome. Sequences everything.
"Targeted" sequencing
"Targeted" sequencing
Sequencing specific regions of a genome. Focuses on localized areas.
Whole-Genome Sequencing (WGS)
Whole-Genome Sequencing (WGS)
Sequences all genetic information of an organism including nDNA, mtDNA, cpDNA, coding/non-coding and transcribed/non-transcribed regions.
Whole-Exome Sequencing (WES)
Whole-Exome Sequencing (WES)
Signup and view all the flashcards
Whole-Transcriptome (RNA-Seq)
Whole-Transcriptome (RNA-Seq)
Signup and view all the flashcards
Sequencing
Sequencing
Signup and view all the flashcards
Solid-Phase Amplification
Solid-Phase Amplification
Signup and view all the flashcards
Illumina Sequencing
Illumina Sequencing
Signup and view all the flashcards
Clusters
Clusters
Signup and view all the flashcards
Bridge Amplification
Bridge Amplification
Signup and view all the flashcards
Multiplex Sequencing
Multiplex Sequencing
Signup and view all the flashcards
Library Preparation
Library Preparation
Signup and view all the flashcards
Reads
Reads
Signup and view all the flashcards
Read Mapping
Read Mapping
Signup and view all the flashcards
Variant Calling
Variant Calling
Signup and view all the flashcards
Read Group (RG)
Read Group (RG)
Signup and view all the flashcards
Adapter Ligation
Adapter Ligation
Signup and view all the flashcards
DNA Polymerase
DNA Polymerase
Signup and view all the flashcards
dNTP Incorporation
dNTP Incorporation
Signup and view all the flashcards
Ion Sequencing Sample Prep
Ion Sequencing Sample Prep
Signup and view all the flashcards
Ion Sequencing pH Sensor
Ion Sequencing pH Sensor
Signup and view all the flashcards
Proton to Voltage Conversion
Proton to Voltage Conversion
Signup and view all the flashcards
Ion Sequencing Process
Ion Sequencing Process
Signup and view all the flashcards
Base Call Determination
Base Call Determination
Signup and view all the flashcards
NGS Assay
NGS Assay
Signup and view all the flashcards
DNA Fragment
DNA Fragment
Signup and view all the flashcards
Insert Size
Insert Size
Signup and view all the flashcards
Single-End (SE) Sequencing
Single-End (SE) Sequencing
Signup and view all the flashcards
Paired-End (PE) Sequencing
Paired-End (PE) Sequencing
Signup and view all the flashcards
Sequence Alignment
Sequence Alignment
Signup and view all the flashcards
Advantage of Paired-End Reads
Advantage of Paired-End Reads
Signup and view all the flashcards
Benefits of Paired-End Reads
Benefits of Paired-End Reads
Signup and view all the flashcards
Variant Allele Fraction (VAF)
Variant Allele Fraction (VAF)
Signup and view all the flashcards
Coverage (Sequencing)
Coverage (Sequencing)
Signup and view all the flashcards
Depth of Coverage (DoC)
Depth of Coverage (DoC)
Signup and view all the flashcards
GC Bias
GC Bias
Signup and view all the flashcards
Variants in DNA
Variants in DNA
Signup and view all the flashcards
Next-generation sequencing: Experimental design
Next-generation sequencing: Experimental design
Signup and view all the flashcards
Paired-end reads
Paired-end reads
Signup and view all the flashcards
What really is sequencing?
What really is sequencing?
Signup and view all the flashcards
Study Notes
Module 1: Next Generation Sequencing (NGS)
- Genomics are covered in Bioinformatics Spring 2025.
Overview
- Covers DNA and its composition.
- Discusses what sequencing really is.
- Touches on Sanger sequencing.
- Focuses on Next-generation sequencing: Illumina seq and Ion sequencing.
- Talks about Next-generation sequencing: Experimental design.
- Discusses Paired-end reads
- Touches on the appearance of data.
- Covers Third Generation Sequencing.
- Discusses genetic variation
In Class Exercise: Sequencing so far: Concept Map
- Concept maps visually represent information.
- Focus should be on the interrelatedness of objects with each other.
- Should focus on effortful information retrieval, and supplement the map during study.
Solid-Phase Amplification
- Genomic DNA is prepared by fragmenting and ligating adapters to both ends.
- Single-stranded fragments bind randomly to the flow cell channel's inner surface.
- Enzymes incorporate nucleotides to build double-stranded bridges on the solid-phase substrate.
- Denaturation leaves single-stranded templates anchored to the substrate.
- Unlabeled nucleotides and enzymes initiate solid-phase bridge amplification.
- After the P5 region is cleaved, clusters containing only fragments attached by the P7 region remain, ensuring sequencing in the same direction.
- Millions of double-stranded DNA are generated in each flow cell channel, creating unique "clusters."
- Clusters are all the same DNA, clonally amplified via bridge amplification.
Illumina Sequencing
- Labeled nucleotides are sent over the flow cell for sequencing.
- After the sequence index, Paired-end sequencing is performed.
- The bridge amplification process repeats
- P7 is cut instead of P5.
- Resulting clusters contain fragments attached by the P5 region that creates the opposite Second Read.
The Cost and Output of Illumina Machines
- iSeq output per is 1.2 Gb with a instrument price of $19.9K.
- MiniSeq output per run is 7.5 Gb with a instrument price of $49.5K.
- MiSeq output per run is 15 Gb with a instrument price of $99K.
- NextSeq output per run is 120 Gb with a instrument price of $275K.
- HiSeq output per run is 1.5 Tb with a instrument price of $900K.
- HiSeq X output per run is 1.8 Tb with a instrument price of $6M/$10M.
- NovaSeq output per run is 1 Tb - 6 Tb with a instrument price of $985K.
Remember the H+
- The incorporation of a deoxyribonucleotide triphosphate (dNTP) into an extending DNA strand involves DNA Polymerase.
- This process involves the formation of a covalent bond and releases pyrophosphate and a positively charged hydrogen ion.
Thermo Fisher Ion Sequencing
- Genomic DNA fragments are clonally amplified onto acrylamide beads.
- DNA polymerase and primers are added to each bead-bound template strand, and the beads are delivered to individual wells on sequencing chips (emPCR).
- Each well has a sensor sensitive to pH shifts, converting the release of protons into voltage.
- During sequencing, each of the four nucleotides is added in turn.
- When a single proton is released, it converts to an electrical signal.
- The semiconductor gathers and processes information from each well as nucleotides are added.
- Signals are then processed into base calls.
- Multiple bases can be added to the growing complimentary strand.
High-Level Sequencing Jargon
- NGS is a type of assay to measure many analytes at once.
- The type of NGS assay used depends on the desired information.
- Full genomes or "targeted" regions can be sequenced.
- "De novo" sequencing doesn't require prior knowledge of the target organism and sequences everything potentially.
- Whole-Genome Sequencing (WGS) is an example of de novo sequencing.
- "Targeted" sequencing selects for the localized regions of a genome.
- Whole-Exome Sequencing (WES), Targeted regions (amplicon), and Whole-Transcriptome Sequencing (RNA-Seq) are examples of targeted sequencing.
- Different assays need different library preparations to select for the desired pool of templates.
- Templates are fragmented DNA or PCR amplicons selected for size.
Different Types of NGS Assays
- Whole-Genome sequencing includes all genetic information of an organism.
- Whole-Exome sequencing includes the complete exon with no introns.
- Target Regions sequencing researches key genomic regions.
- Whole-Transcriptome sequencing includes the total RNA molecules can differ in the same individual.
Targeted Regions (Again, it's a type of Assay)
- Targeted assays are far less expensive, enabling use on many individuals.
- Targeted assays can be used for screen, like screening for the cause (OC).
- Allows researchers and clinicians to focus on regions of the genome under investigation.
- Much deeper sequencing allows identifies rare variants missed by CE tools.
- The 395-gene and 120-gene panel are examples of targeted assays.
- Primers are designed for PCR for specific genes, such as KRAS, EGFR, and BRAF.
How does NGS work in Practice?
- Process involves library preparation, sequencing, and the mapping of reads to the genome.
- After reads have been mapped to the genome, the variants discovery and annotated.
Different Types of Experimental Design
- Experimental design types: whole genome and exome
Library Preparation
- The library preparation process typically involves extracting nucleic acids from samples like blood, tissue, or saliva.
- Double-stranded DNA is sheared into fragments through enzymatic or mechanical means.
- Adapter ligation is performed for cluster generation and clonal amplification.
- Protocols vary based on the experimental design (WGS, exome, etc.).
Sequencing the Library
- DNA libraries are deposited on a flow cell and amplified to form clusters, processed by a HTS machine.
- Each lane contains samples (indexed with a DNA barcode).
- A Read Group (RG) combines the sample and the flow cell.
- Next, sequencing occurs by synthesis with reversible terminators.
- Generates enormous piles of reads.
What Was Actually Sequenced (Illumina)?
- Its a piece of fragmented DNA, with two adapter sequences at the end.
- With on insert of 220bp, and performing 100bp PE sequencing, the inner distance is 20 bp.
How Are Reads Aligned?
- Single-end reads align for WGS/WES using the same reference for short fragment sequenced
Aligning the Reads to the Reference Genome
- The reads are easy to sort by chromosome and then position.
- Paired-end reads can help with difficult sorting decisions.
Why Use Paired-End Reads?
- Sequences aligned as read pairs enables accurate read alignment.
- Alignment algorithms can map repeats using algorithms with precise distances.
Once Reads Aligned: High-Level Sequencing Data Jargon
- Coverage (20x) indicates that each base is covered by 20 sequencing reads on average.
- Variant Allele Fraction (VAF) estimates the proportion of reads containing a variant allele.
- Quality scores asses base calling accuracy.
What WGS and WES Reads look like in practice?
- Comparing how Whole genome (WGS) and Exome aligned reads differs over base pairs (bp)
Different Exome Kits Produce Different Analyzable Territory
- Exome kits produce different analyzable territory, which leads to decent and bad coverage differences.
Depth of Coverage with NGS
- Can be Biased (e.g. GC bias or experimental)
- Hard to know what's the true reads mapped.
GC Bias
- Can make GC-rich region poorly covered and unusable Uneven Coverage in different WGS Examples
- Reaches an unevenly distribution coverage with piles of unmapped data due to different DNA and leads to changes.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
This lesson covers genomics in Bioinformatics Spring 2025. It discusses DNA, sequencing, Sanger sequencing, Illumina seq, Ion sequencing, experimental design, paired-end reads, data appearance, third generation sequencing, and genetic variation.