Podcast
Questions and Answers
Which activity best describes bioinformatics regarding medical or biological data?
Which activity best describes bioinformatics regarding medical or biological data?
- Acquiring, storing, visualizing, and interpreting. (correct)
- Storing only biological data for academic research.
- Primarily focused on the development of medical devices.
- Visualization and interpretation of medical data only.
What do the underlying principles of bioinformatics involve?
What do the underlying principles of bioinformatics involve?
- Solely computational methods.
- Combining computational and experimental approaches. (correct)
- Focusing on data visualization techniques.
- Exclusively experimental approaches.
When did the exponential increase in molecular sequence data begin?
When did the exponential increase in molecular sequence data begin?
- Early 1980s. (correct)
- Early 1960s.
- Early 1990s.
- Early 1970s.
Which factor significantly transformed access to data and publications related to bioinformatics?
Which factor significantly transformed access to data and publications related to bioinformatics?
Which application did bioinformatics significantly contribute to regarding the CRISPR-Cas system?
Which application did bioinformatics significantly contribute to regarding the CRISPR-Cas system?
What is the purpose of using bioinformatics in translational drug discovery?
What is the purpose of using bioinformatics in translational drug discovery?
Which is the primary goal of identifying molecular features in cancer therapy?
Which is the primary goal of identifying molecular features in cancer therapy?
How did the advancements in sequencing technologies shift molecular genetic pathology?
How did the advancements in sequencing technologies shift molecular genetic pathology?
What has been enabled by the increased capabilities and reduced costs of sequencers?
What has been enabled by the increased capabilities and reduced costs of sequencers?
In clinical genomic sequencing, what is the main focus of primary analysis?
In clinical genomic sequencing, what is the main focus of primary analysis?
What does secondary sequence analysis involve?
What does secondary sequence analysis involve?
What is the purpose of variant calling?
What is the purpose of variant calling?
What is the goal of tertiary analysis?
What is the goal of tertiary analysis?
What key analysis does annotation involve?
What key analysis does annotation involve?
In Bioinformatics what is meant by the term, reference genome
?
In Bioinformatics what is meant by the term, reference genome
?
What is the primary function of the Variation Viewer tool?
What is the primary function of the Variation Viewer tool?
What does the acronym HGVS stand for as it relates to Databases for Variant Annotation?
What does the acronym HGVS stand for as it relates to Databases for Variant Annotation?
Which of the following best describes the role of bioinformatics pipelines in clinical genomic practice?
Which of the following best describes the role of bioinformatics pipelines in clinical genomic practice?
Which of the following is an example of an advanced computational bioinformatics technique?
Which of the following is an example of an advanced computational bioinformatics technique?
Which of the following is an example of a pioneering intitution in bioinformatics?
Which of the following is an example of a pioneering intitution in bioinformatics?
What occurs in the Bioinformatics stage of Development
according to the Drug discovery pipeline?
What occurs in the Bioinformatics stage of Development
according to the Drug discovery pipeline?
What aspect of bioinformatics deals directly with identifying translational opportunities?
What aspect of bioinformatics deals directly with identifying translational opportunities?
According to table 5, which dataset is not described as providing information about Population frecuency?
According to table 5, which dataset is not described as providing information about Population frecuency?
Clinical genomics would depend the least on what process to asses its effects?
Clinical genomics would depend the least on what process to asses its effects?
What kind of files work within an interpret data.
What kind of files work within an interpret data.
Which of the following annotations is available through Alamut?
Which of the following annotations is available through Alamut?
What type of variants has the dbSNP in annotation dataset?
What type of variants has the dbSNP in annotation dataset?
GenBank, EMBL (European Molecular Biology Laboratory nucleotide sequence database).?
GenBank, EMBL (European Molecular Biology Laboratory nucleotide sequence database).?
What information does HGMD Database provide in Datasets and Databases for Variant Annotation?
What information does HGMD Database provide in Datasets and Databases for Variant Annotation?
Flashcards
Bioinformatics
Bioinformatics
A discipline handling genetic information using computational tools to acquire, store, visualize, and interpret medical or biological data.
Growth of Sequence Data
Growth of Sequence Data
The exponential increase in molecular sequence data starting in the 1980s due to accessible DNA sequencing methods.
EMBL
EMBL
A biological database that stores and provides access to nucleotide sequence data.
Artemis Comparison Tool (ACT)
Artemis Comparison Tool (ACT)
Signup and view all the flashcards
Bioinformatics in Translational Drug Discovery
Bioinformatics in Translational Drug Discovery
Signup and view all the flashcards
Bioinformatics Role in Drug Discovery
Bioinformatics Role in Drug Discovery
Signup and view all the flashcards
Primary Analysis
Primary Analysis
Signup and view all the flashcards
Secondary Analysis
Secondary Analysis
Signup and view all the flashcards
Reference Genome
Reference Genome
Signup and view all the flashcards
Tertiary Analysis
Tertiary Analysis
Signup and view all the flashcards
ClinVar
ClinVar
Signup and view all the flashcards
What is Primary analysis?
What is Primary analysis?
Signup and view all the flashcards
Variant calling
Variant calling
Signup and view all the flashcards
Study Notes
- Bioinformatics is a discipline for handling genetic information.
- It involves computational tools and approaches to acquire, store, visualize, and interpret medical or biological data.
- Bioinformatics requires advanced computational methods to process large datasets.
- These methods are used to form decision-support systems for complex problems in medicine, biology, and related fields.
Historical Context
- The term "bioinformatics" has been in the scientific lexicon for around two decades.
- The underlying principles of bioinformatics come from studies that combined computational and experimental approaches.
Pioneering Institutions
- Here are ten institutions that pioneered and fostered computation in biology:
- Birkbeck College, University of London (UK)
- Boston University (USA)
- European Molecular Biology Laboratory (EMBL) (DE and EMBL)
- Institute of Protein Research, Academy of Sciences (Former USSR)
- Laboratory of Molecular Biology (LMB), MRC (UK)
- Los Alamos National Laboratory (LANL) (USA)
- National Biomedical Research Foundation (USA)
- Stanford University (USA)
- University of California San Francisco (UCSF) (USA)
- University College, University of London (UCL) (UK)
Sequencing and Bioinformatics
- The wide availability of methods for DNA sequencing in the early 1980s led to exponential growth in molecular sequence data
- Data was stored in databases like GenBank and EMBL (European Molecular Biology Laboratory nucleotide sequence database).
- The arrival of the Internet transformed databases and access to data, publications, and information infrastructure
Bioinformatics Applications:
- CRISPR-Cas bioinformatics:
- Bioinformatics played a key role in the discovery of the CRISPR-Cas system and identification of new functions.
- Yoshizumi Ishino described the signature repeat-spacer architecture of CRISPR arrays in 1987.
- Francisco Mojica, aided by bioinformatics, showed CRISPR arrays are present in Escherichia coli, most archaeal, and many bacterial genomes.
Bioinformatics tools:
- Artemis Comparison Tool (ACT):
- ACT identifies and analyzes regions of similarity and difference between genomes.
- It explores the conservation of synteny in the context of the entire sequences and their annotation.
- ACT reads complete EMBL, GENBANK, and GFF entries, and FASTA or raw format sequences.
Bioinformatics in Translational Drug Discovery
- Disease-based bioinformatics approaches in translational drug discovery depend on the disease type.
- They implement different strategies to analyze cancer, genetic, and infectious diseases.
- Bioinformatics approaches help identify key drivers of cancer in patients, for personalized cancer therapy.
Bioinformatics in Clinical Genomic Sequencing
- High-throughput sequencing technologies have transformed molecular genetic pathology into an informatics-driven specialty.
- Increasing capabilities and reduced costs have made large-scale sequencing (exomes and genomes) accessible to many patients.
Sequencing Technologies
- First Generation: Sanger Sequencing (500-1000 bp fragments)
- Second Generation: 454, Solexa, Ion Torrent, Illumina (~50-500 bp fragments)
- Third Generation: PacBio, Oxford Nanopore (Tens of kb fragments on average)
Clinical Bioinformatics Process:
- Primary Analysis: Algorithms convert raw sequence reads to a string of As, Cs, Ts, and Gs.
- Secondary Analysis:
- Mapping or aligning sequence reads onto the reference genomic sequence.
- Variant calling identifies differences between the individual's sequence and the reference genome.
- A reference genome is a digital nucleic acid sequence database, a representative example of the set of genes.
- Variation Viewer allows viewing, searching, and navigating variations in dbSNP, dbVar, and ClinVar.
- Tertiary Analysis: Sequence variants are placed in the genomic and clinical context.
- Annotation: Variants are annotated to assess their effects on biology, physiology, and clinical relevance. This involves extracting knowledge at the gene, transcript, protein, and regulatory levels from biological databases.
Datasets and Databases for Variant Annotation
- dbSNP: Central repository for SNP/indel polymorphisms.
- Human Gene Mutation Database (HGMD): Repository for gene lesions responsible for human inherited disease.
- ClinVar: Repository of human variants related to phenotypes with supporting evidence.
- LOVD: Repository of gene-centered DNA variations, storage of patient-centered data and NGS data.
- 1000 Genomes: Project to find genetic variants with frequencies of ≥ 1%.
- gnomAD: Resource aggregating exome and genome sequencing data from large-scale sequencing projects.
- ExAC: Same as gnomAD.
- Alamut: Database that integrates NCBI, EBI, UCSC, HGVS nomenclature.
- dbNSFP: Database of human nonsynonymous SNPs and their functional predictions.
Conclusion
- Genomic sequencing is increasingly part of clinical practice.
- Understanding bioinformatics tools' purposes, outputs, and assumptions is crucial for molecular geneticists/pathologists.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.