Podcast
Questions and Answers
What does the term 'proteoform' reflect?
What does the term 'proteoform' reflect?
- The number of genes in the genome
- The structure of DNA
- The function of enzymes
- The diversity of proteins (correct)
There are approximately 20,000 genes in the genome and about 1,000 protein variants.
There are approximately 20,000 genes in the genome and about 1,000 protein variants.
False (B)
What is one basic principle of mass spectrometry?
What is one basic principle of mass spectrometry?
Ionization of samples
In mass spectrometry, the __________ is responsible for measuring the time it takes for ions to travel and reach the detector.
In mass spectrometry, the __________ is responsible for measuring the time it takes for ions to travel and reach the detector.
Match the types of mass spectrometry ionization with their correct method:
Match the types of mass spectrometry ionization with their correct method:
What contrast does Systems Biology present compared to reductionist biology?
What contrast does Systems Biology present compared to reductionist biology?
Gene set enrichment analysis (GSEA) is used to discover pathways containing fewer differentially expressed genes than expected by chance.
Gene set enrichment analysis (GSEA) is used to discover pathways containing fewer differentially expressed genes than expected by chance.
What is one important reason why protein modifications are studied in addition to RNA sequencing?
What is one important reason why protein modifications are studied in addition to RNA sequencing?
Systems Biology provides ____ insight not easily achieved by single gene approaches.
Systems Biology provides ____ insight not easily achieved by single gene approaches.
Match the following analytical methods with their purpose:
Match the following analytical methods with their purpose:
Which of the following is a consideration when performing topological analyses of networks?
Which of the following is a consideration when performing topological analyses of networks?
Correlation implies causation in bioinformatics data analysis.
Correlation implies causation in bioinformatics data analysis.
What does GSEA aim to provide as a result?
What does GSEA aim to provide as a result?
What does it mean if a gene is described as being upregulated?
What does it mean if a gene is described as being upregulated?
When analyzing gene expression, it is common to assume that most analytes differ significantly between samples.
When analyzing gene expression, it is common to assume that most analytes differ significantly between samples.
What is the purpose of correction for multiple testing in differential expression analyses?
What is the purpose of correction for multiple testing in differential expression analyses?
A _______ plot is used in differential expression analyses to visualize the significance and magnitude of changes in gene expression.
A _______ plot is used in differential expression analyses to visualize the significance and magnitude of changes in gene expression.
Match the following terms related to gene expression with their definitions:
Match the following terms related to gene expression with their definitions:
Which statement is true about genes expressed in specific tissues?
Which statement is true about genes expressed in specific tissues?
Gene expression analysis involves looking at how individual genes work in isolation.
Gene expression analysis involves looking at how individual genes work in isolation.
What is the role of networks in systems biology?
What is the role of networks in systems biology?
What is the primary advantage of RNA-sequencing over microarrays?
What is the primary advantage of RNA-sequencing over microarrays?
RNA-sequencing requires a reference genome for its analysis.
RNA-sequencing requires a reference genome for its analysis.
What is the typical dynamic range for RNA-sequencing?
What is the typical dynamic range for RNA-sequencing?
The process of filtering low-quality reads in RNA-sequencing is part of the ____ phase.
The process of filtering low-quality reads in RNA-sequencing is part of the ____ phase.
Match the following transcriptomics methods with their characteristics:
Match the following transcriptomics methods with their characteristics:
What is a disadvantage associated with microarrays?
What is a disadvantage associated with microarrays?
Both RNA-sequencing and microarrays have a technical reproducibility greater than 90%.
Both RNA-sequencing and microarrays have a technical reproducibility greater than 90%.
What is a typical analysis step that follows the Quality Control in RNA sequencing?
What is a typical analysis step that follows the Quality Control in RNA sequencing?
In RNA-sequencing, the approximate sensitivity is ____ transcripts per million.
In RNA-sequencing, the approximate sensitivity is ____ transcripts per million.
Which of the following is true regarding the input RNA amount required for RNA-sequencing?
Which of the following is true regarding the input RNA amount required for RNA-sequencing?
What technology was used to quantify plasma proteins in the UK Biobank study?
What technology was used to quantify plasma proteins in the UK Biobank study?
The UK Biobank study identified a total of 14,287 primary associations with environmental factors.
The UK Biobank study identified a total of 14,287 primary associations with environmental factors.
What are protein quantitative trait loci (pQTL)?
What are protein quantitative trait loci (pQTL)?
The primary technique used for metabolomics includes _____ and NMR.
The primary technique used for metabolomics includes _____ and NMR.
Match the following omics techniques with their applications:
Match the following omics techniques with their applications:
What is the goal of using omics techniques in the context of disease?
What is the goal of using omics techniques in the context of disease?
Individual variation and co-variates are not important considerations in data analysis.
Individual variation and co-variates are not important considerations in data analysis.
What are protein-protein interactions (PPI) commonly targeted for?
What are protein-protein interactions (PPI) commonly targeted for?
What is one of the critical challenges in absolute quantification of peptides?
What is one of the critical challenges in absolute quantification of peptides?
Chemical labeling can be used to study multiple samples in a single LC-MS/MS run.
Chemical labeling can be used to study multiple samples in a single LC-MS/MS run.
What is the importance of mapping peptides to proteins in proteomics?
What is the importance of mapping peptides to proteins in proteomics?
The study of _____ proteins can aid drug development efforts.
The study of _____ proteins can aid drug development efforts.
Match the following terms with their descriptions:
Match the following terms with their descriptions:
Which of the following is a potential benefit of using omics approaches in medicine?
Which of the following is a potential benefit of using omics approaches in medicine?
Cancers can be heterogenous and may respond similarly to treatments among subgroups.
Cancers can be heterogenous and may respond similarly to treatments among subgroups.
What is meant by precision medicine?
What is meant by precision medicine?
Flashcards
Global Normalisation
Global Normalisation
A method used to adjust gene expression data between samples, assuming that most genes do not differ significantly. It involves calculating the average expression level across all samples and scaling each sample's data to match this average.
Global LOESS Normalisation
Global LOESS Normalisation
A specific type of global normalisation method that uses a smooth curve (LOESS) to adjust the data across the entire range of expression values.
Differential Expression Analysis
Differential Expression Analysis
A statistical method used to identify genes that show significant differences in expression levels between different conditions (e.g., disease vs. control).
Volcano Plot
Volcano Plot
Signup and view all the flashcards
Multiple Testing Correction
Multiple Testing Correction
Signup and view all the flashcards
Upregulated Gene
Upregulated Gene
Signup and view all the flashcards
Downregulated Gene
Downregulated Gene
Signup and view all the flashcards
Gene Network
Gene Network
Signup and view all the flashcards
Spatial Transcriptomics
Spatial Transcriptomics
Signup and view all the flashcards
RNA-Sequencing
RNA-Sequencing
Signup and view all the flashcards
Microarray
Microarray
Signup and view all the flashcards
Technical Reproducibility
Technical Reproducibility
Signup and view all the flashcards
Data Normalization
Data Normalization
Signup and view all the flashcards
Quality Control (QC)
Quality Control (QC)
Signup and view all the flashcards
Systems Biology
Systems Biology
Signup and view all the flashcards
Principal Component Analysis (PCA)
Principal Component Analysis (PCA)
Signup and view all the flashcards
Gene Dispersion Estimates
Gene Dispersion Estimates
Signup and view all the flashcards
Sensitivity
Sensitivity
Signup and view all the flashcards
Proteoform
Proteoform
Signup and view all the flashcards
Proteome
Proteome
Signup and view all the flashcards
Post-translational modification (PTM)
Post-translational modification (PTM)
Signup and view all the flashcards
Dynamic Range
Dynamic Range
Signup and view all the flashcards
Mass Spectrometry
Mass Spectrometry
Signup and view all the flashcards
Network Representations
Network Representations
Signup and view all the flashcards
Pathway Analysis
Pathway Analysis
Signup and view all the flashcards
Gene Set Enrichment Analysis (GSEA)
Gene Set Enrichment Analysis (GSEA)
Signup and view all the flashcards
Network Analyses
Network Analyses
Signup and view all the flashcards
Genome-scale Models (GSMs)
Genome-scale Models (GSMs)
Signup and view all the flashcards
Topological Analyses
Topological Analyses
Signup and view all the flashcards
Proteomics
Proteomics
Signup and view all the flashcards
How to estimate random hits in proteomics?
How to estimate random hits in proteomics?
Signup and view all the flashcards
How are proteins quantified in mass spectrometry?
How are proteins quantified in mass spectrometry?
Signup and view all the flashcards
What is quantitative proteomics?
What is quantitative proteomics?
Signup and view all the flashcards
How is quantitative proteomics data analyzed?
How is quantitative proteomics data analyzed?
Signup and view all the flashcards
What is the advantage of focusing on a specific part of the proteome?
What is the advantage of focusing on a specific part of the proteome?
Signup and view all the flashcards
What is the role of omics in precision medicine?
What is the role of omics in precision medicine?
Signup and view all the flashcards
How can we subclassify diseases using omics data?
How can we subclassify diseases using omics data?
Signup and view all the flashcards
How can we use omics to guide cancer treatment?
How can we use omics to guide cancer treatment?
Signup and view all the flashcards
Interaction Proteomics
Interaction Proteomics
Signup and view all the flashcards
Pan-cancer Plasma Proteomics
Pan-cancer Plasma Proteomics
Signup and view all the flashcards
Metabolomics
Metabolomics
Signup and view all the flashcards
Protein quantitative trait loci (pQTL)
Protein quantitative trait loci (pQTL)
Signup and view all the flashcards
Biomarkers
Biomarkers
Signup and view all the flashcards
Multi-omics
Multi-omics
Signup and view all the flashcards
Target Discovery
Target Discovery
Signup and view all the flashcards
Biologics Development
Biologics Development
Signup and view all the flashcards
Study Notes
Biopharmaceuticals (2024) - Omics Techniques for Target Identification and Biologics Development
- Omics techniques are used for target discovery
- Omics techniques include genomics, transcriptomics, proteomics, and metabolomics
- Analysis of omics data identifies targets for personalized medicine
- Proteomics focuses on protein applications, with examples used
- Omics helps find relevant targets and develop biologics
- Omics techniques involve a conceptual understanding of processes like sequencing
Lecture Overview
- Omics can be used for target discovery
- Omics techniques (genomics, transcriptomics, proteomics and metabolomics)
- Analysis of omics data identifies targets
- Omics is used for personalized medicine
- Proteomics has specific focus areas and examples of applications
Learning Objectives
- Omics helps in discovering relevant targets and developing biologics
- Conceptual understanding of omics techniques
- Omics and its concepts related to personalized medicine
Omics - Studies of the Entire Collection of a Type of Molecules
- Genomics studies the entire set of genes contained in chromosomes
- Transcriptomics studies the set of mRNA molecules expressed under specific conditions
- Proteomics studies the set of proteins expressed under specific conditions, including modifications
- Metabolomics studies the set of small molecules present under specific conditions
- The central dogma of biology relates DNA, RNA, and proteins
OMICS Glossary
- Genomics: Study of genes in chromosomes
- Transcriptomics: Study of mRNA molecules under specified conditions
- Proteomics: Study of proteins expressed under specified conditions and modifications
- Metabolomics: Study of small molecules at a given time
Omics for Target Identification (Target Selection)
- Identify molecular differences between healthy and diseased states
- Use samples like patient biopsy samples, patient-derived cell lines
- Compare sample groups to distinguish disease variation from normal variation
- Find causative differences to pinpoint suitable targets
Some Techniques for Omics
- Genomics: Whole genome sequencing (WGS), targeted resequencing using NGS, chip-based variant detection, exome sequencing.
- Transcriptomics: RNA-sequencing (NGS), cDNA microarrays, single cell sequencing (NGS).
- Proteomics: Mass spectrometry, affinity-based proteomics.
- Metabolomics: NMR, Mass Spectrometry, Metagenomics (NGS), single-cell omics, Epigenetics (NGS or microarrays), spatial omics.
NGS - Massively Parallel Sequencing
- The cost of sequencing a full human genome has decreased dramatically over time.
Sequencing: How Does It Work?
- Sanger sequencing involves chain termination PCR using fluorescent-labeled ddNTPs and separating fragments based on size
- Second-generation sequencing involves shearing genomic DNA, ligation of adaptors, hybridization to a solid surface, and amplification to form clusters
- Sequencing involves highly sensitive cameras to record the order of lights.
Brief About the Bioinformatics Workflow for NGS Data
- Paired-end reads from NGS data are mapped to a reference genome
- Overlapping reads create contigs, which combined with gaps form scaffolds.
Third-Generation Sequencing (Long-Read Sequencing)
- Longer reads are used (more substantial length than from second-generation sequencing)
- PacBio and Oxford nanopore are common techniques
- Facilitates mapping of reads and isoforms particularly in repeat regions
NGS Applications in Human Health
- Technologies and their analysis in genomics, transcriptomics, epigenomics, metagenomics
- Analysis methods like point mutations, small indels, copy number variations, lineage identification, differential expression, RNA editing, methylation
- Applications from functional effects of mutations to network and pathway analysis and integrative analysis
Genomics
- Genome-level deviations cause diseases (point mutations, indels, copy number variations)
- Associations identified between genotype and phenotype through Genome-wide Association Studies (GWAS).
- To find statistical power, large patient cohorts are needed
Transcriptomics
- Identifies genes that are expressed
Techniques in Transcriptomics
- Transcriptomics technologies (DNA microarrays, RNA-Seq, scRNASeq, spatial transcriptomics)
RNA-Sequencing vs Microarrays
- Technical approaches and processes in RNA-sequencing and microarrays
- Technical comparison and differences between RNA-sequencing and microarrays
Comparison of Contemporary Methods for Bulk Transcriptomics
- A table comparing RNA-Seq and Microarray techniques (throughput, input RNA amount, labor intensity, prior knowledge, quantitation accuracy, sequence resolution, sensitivity, dynamic range, technical reproducibility)
Data Processing of Analysis for Target Discovery Using RNA-Sequencing/Gene Expression Data
- Data quality control (QC) steps in RNA-sequencing, including filtering low-quality reads, alignment, sample & gene level quality control, statistical analysis, systems biology, and enrichment analysis, data visualization
- Need to reduce experimental noise to isolate desired types of variation (biologically relevant)
QC: PCA Analyses for Sample QC
- Principal component analysis (PCA) used to assess the quality of samples
QC: Gene Level Filtering: Gene Dispersion Estimates
- Gene level filtering uses gene dispersion estimates with normalization methods
Data Normalization
- Normalization techniques compensate for differences in sample amounts
Between Sample Normalisation
- Statistical tests and correction (across multiple types of testing) determine significant differences.
- Example use of volcano plot
Differential Expression Analyses
- Differential expression analyses identify differences in gene expression between disease and control conditions
- Analyze genes that are higher or lower between samples from different conditions
- Differentiates between upregulated and downregulated genes
Gene Expression Varies Across Tissues and Conditions (Tissue-specific gene expression)
- Gene expression patterns vary across different tissues and conditions
- Analyze gene expression by looking at samples across tissues to understand more about tissuespecific variations.
- Includes visualization plots
Network Analyses and Systems Biology
- Networks and pathways to analyze overall patterns in data
- Network analysis as representations of complex systems
- Use of networks to study global properties of interacting components
Pathway Analyses, Visualization for Interpretation
- Analyze genes via pathways and interpret results based on integration and existing knowledge
Gene Set Enrichment Analyses
- Statistical methods to find pathways with higher differentially expressed genes than expected by random.
- Method called Gene Set Enrichment Analysis (GSEA)
- A list of significant pathways or gene sets generated.
Network Analyses for Signatures
Genome-Scale Models for Organisms and Tissue Metabolism
- The complexity and needs of large-scale models are discussed
Topological Analyses of Networks
- Describe how graph analyses are used to study interactions
Things to Consider
- The reproducibility and replicability of results are considered
- Important factors like batch effects, randomization, and validation are highlighted
Proteomics
- Proteomics is closer to the phenotype
Why Bother? (Isn't RNA Sequencing Enough?)
- Protein modifications, localization, interactions, and sometimes low correlation between protein and RNA levels need to be considered and studied
Omes and Complexity
- Transcription, alternative splicing, post-translational modifications, interactome, and molecular functions
Challenges
- Large number of genes in the genome, dynamic range of proteins
How to Measure Protein Levels?
- Mass spectrometry and affinity reagents/antibodies
Basic Principles of Mass Spectrometry
- Sample ionization, gas phase ions, ion sorting, detection, mass spectrum
A Typical (Bottom-Up) Mass Spectrometry Proteomic Workflow
- Workflow steps for typical bottom-up mass spectrometry proteomics
The Workhorse - Tandem Mass Spectrometry
- Digestion, ionization, isolation, fragmentation, mass analysis
MS/MS Peptide Fragmentation
- Fragmentation of peptides in MS/MS
MS/MS of Peptide
- Details of MS/MS results of peptide
Search Space Decreased as Trypsin Cleavage Pattern Is Known
- Trypsin cleavage pattern decreases the search space in protein analysis
Match Scoring
- Peptide candidate matches, algorithm assumptions, probabilistic scores
Quantification?
- Relative and absolute quantification in proteomics
What to Do with Quantitative Proteomics Data
- Comparative analysis with transcriptomics, mapping to pathways
Benefits
- Studying relevant proteomes for drug purposes
Omics and Precision Medicine
- Heterogeneity of diseases, omics for stratifying disease, companion diagnostics.
Subclassification of Disease Using Expression Data (Disease Subtypes)
- Analysis of gene expression data of cancers to create subgroups based on responses to treatment
Breast Cancer Example
- Gene expression profiling to classify breast cancer and using the PAM50 classification
- Uses a selected gene panel, and the results are used in treatment decisions
Unsupervised Clustering of Tumor Biopsies
- Clustering using PAM50 proteins and mRNA analysis for tumor biopsies
Dynamic Range Problem
- High natural dynamic range causing challenges for mass spectrometry-based proteomics
Reference Intervals for Protein Analytes in Plasma
- Normal reference ranges of different protein analytes
Sample Fractionation and New Instruments Help
- Proteomics sample fractionation and new instruments
Some Current Figures
- Recent extensive separation and state of the art mass spectrometry techniques used to measure proteins in tissue samples
Affinity Binder-Based Quantification
- Techniques like aptamer-based and antibody-based sequence methods
Recent UK Biobank Example
- Details about a recent UK Biobank quantitative plasma protein study
Pan-Cancer Plasma Proteomics
- Next-generation approaches to pan-cancer blood proteome profiling
Interaction Proteomics
- Interactions among proteins
Interaction Proteomics - SARS-CoV-2 Example
- An example of studying protein-protein interactions for drug repurposing
Metabolomics
- Reflecting the phenotype and analytically challenging to capture all metabolites through common techniques
Target Discovery Followed by Biologics Development
- Interaction partners of biopharmaceuticals; effects of biologics (using omics); omics for finding systemic side effects
Summary
- Omics for molecular differences between sick and healthy, or subgroups
- Critical data analysis
- Considerations for individual variation, co-variates, and complex network of biomolecules
- Choice of techniques depending on disease & availability
- Multi-omics studies to improve precision medicine
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Test your knowledge on systems biology concepts, molecular biology, and mass spectrometry techniques. This quiz covers essential principles such as proteoforms, gene expression analysis, and analytical methods. Challenge yourself with various questions to deepen your understanding of the interconnectedness of biological systems.