L6 | Introduction to Bioinformatics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Which activity best describes bioinformatics regarding medical or biological data?

  • Acquiring, storing, visualizing, and interpreting. (correct)
  • Storing only biological data for academic research.
  • Primarily focused on the development of medical devices.
  • Visualization and interpretation of medical data only.

What do the underlying principles of bioinformatics involve?

  • Solely computational methods.
  • Combining computational and experimental approaches. (correct)
  • Focusing on data visualization techniques.
  • Exclusively experimental approaches.

When did the exponential increase in molecular sequence data begin?

  • Early 1980s. (correct)
  • Early 1960s.
  • Early 1990s.
  • Early 1970s.

Which factor significantly transformed access to data and publications related to bioinformatics?

<p>The arrival of the Internet. (D)</p> Signup and view all the answers

Which application did bioinformatics significantly contribute to regarding the CRISPR-Cas system?

<p>Identifying new functions of the system. (C)</p> Signup and view all the answers

What is the purpose of using bioinformatics in translational drug discovery?

<p>To enable a more personalized approach to cancer therapy. (C)</p> Signup and view all the answers

Which is the primary goal of identifying molecular features in cancer therapy?

<p>To provide new therapeutic opportunities based on molecular characteristics. (A)</p> Signup and view all the answers

How did the advancements in sequencing technologies shift molecular genetic pathology?

<p>From a visual-based specialty to an informatics-driven specialty. (D)</p> Signup and view all the answers

What has been enabled by the increased capabilities and reduced costs of sequencers?

<p>Large-scale sequencing accessibility. (C)</p> Signup and view all the answers

In clinical genomic sequencing, what is the main focus of primary analysis?

<p>Converting raw sequence reads into a string of As, Cs, Ts, and Gs. (C)</p> Signup and view all the answers

What does secondary sequence analysis involve?

<p>Mapping sequence reads onto a reference genome. (A)</p> Signup and view all the answers

What is the purpose of variant calling?

<p>Identify sequence variations. (B)</p> Signup and view all the answers

What is the goal of tertiary analysis?

<p>To interpret the exact difference between sequence variations. (D)</p> Signup and view all the answers

What key analysis does annotation involve?

<p>Access to effects and biological context. (D)</p> Signup and view all the answers

In Bioinformatics what is meant by the term, reference genome?

<p>A digital representation assembled as a representative example of a set of genes. (C)</p> Signup and view all the answers

What is the primary function of the Variation Viewer tool?

<p>To view, search and navigate variations housed in dbSNP, dbVar and ClinVar. (C)</p> Signup and view all the answers

What does the acronym HGVS stand for as it relates to Databases for Variant Annotation?

<p>Human Genome Variation Society (C)</p> Signup and view all the answers

Which of the following best describes the role of bioinformatics pipelines in clinical genomic practice?

<p>They automate interpretation and reporting of genetic data. (A)</p> Signup and view all the answers

Which of the following is an example of an advanced computational bioinformatics technique?

<p>Advanced Data set processing for large data. (D)</p> Signup and view all the answers

Which of the following is an example of a pioneering intitution in bioinformatics?

<p>The Sanger institute. (A)</p> Signup and view all the answers

What occurs in the Bioinformatics stage of Development according to the Drug discovery pipeline?

<p>Toxicity and Omics Data. (A)</p> Signup and view all the answers

What aspect of bioinformatics deals directly with identifying translational opportunities?

<p>Personalized medicine. (A)</p> Signup and view all the answers

According to table 5, which dataset is not described as providing information about Population frecuency?

<p>HOMD (C)</p> Signup and view all the answers

Clinical genomics would depend the least on what process to asses its effects?

<p>Manual (C)</p> Signup and view all the answers

What kind of files work within an interpret data.

<p>Xls. (B)</p> Signup and view all the answers

Which of the following annotations is available through Alamut?

<p>Functional prediction. (C)</p> Signup and view all the answers

What type of variants has the dbSNP in annotation dataset?

<p>SNP/indel polymorphisms. (A)</p> Signup and view all the answers

GenBank, EMBL (European Molecular Biology Laboratory nucleotide sequence database).?

<p>Store sequence data. (A)</p> Signup and view all the answers

What information does HGMD Database provide in Datasets and Databases for Variant Annotation?

<p>Gene symbol, cDNA sequence, coordinates, HGVS nomenclature (C)</p> Signup and view all the answers

Flashcards

Bioinformatics

A discipline handling genetic information using computational tools to acquire, store, visualize, and interpret medical or biological data.

Growth of Sequence Data

The exponential increase in molecular sequence data starting in the 1980s due to accessible DNA sequencing methods.

EMBL

A biological database that stores and provides access to nucleotide sequence data.

Artemis Comparison Tool (ACT)

A bioinformatics tool used to identify similarities and differences between genomes, exploring conservation of synteny.

Signup and view all the flashcards

Bioinformatics in Translational Drug Discovery

Using bioinformatics to understand the molecular drivers of cancer to enable personalized treatment approaches.

Signup and view all the flashcards

Bioinformatics Role in Drug Discovery

Application of bioinformatics to analyze data and optimize strategies in drug development across various stages.

Signup and view all the flashcards

Primary Analysis

The initial processing step in clinical genomic sequencing, converting raw sequence data into strings of As, Cs, Ts, and Gs.

Signup and view all the flashcards

Secondary Analysis

A process involving mapping sequence reads onto a reference genome to identify differences and variations.

Signup and view all the flashcards

Reference Genome

A digital nucleic acid sequence database assembled by scientists as a representative example of the genes of a species.

Signup and view all the flashcards

Tertiary Analysis

Placing identified sequence variants into a genomic and clinical context to interpret their relevance.

Signup and view all the flashcards

ClinVar

A database with human variants related to phenotypes, integrated from multiple sources to provide supporting evidence.

Signup and view all the flashcards

What is Primary analysis?

Algorithms that are associated to the sequencing machines that convert the raw sequence reads to a string of As, Cs, Ts, and Gs.

Signup and view all the flashcards

Variant calling

Variant calling is likely the biggest black box and most reliant on statistical modeling

Signup and view all the flashcards

Study Notes

  • Bioinformatics is a discipline for handling genetic information.
  • It involves computational tools and approaches to acquire, store, visualize, and interpret medical or biological data.
  • Bioinformatics requires advanced computational methods to process large datasets.
  • These methods are used to form decision-support systems for complex problems in medicine, biology, and related fields.

Historical Context

  • The term "bioinformatics" has been in the scientific lexicon for around two decades.
  • The underlying principles of bioinformatics come from studies that combined computational and experimental approaches.

Pioneering Institutions

  • Here are ten institutions that pioneered and fostered computation in biology:
    • Birkbeck College, University of London (UK)
    • Boston University (USA)
    • European Molecular Biology Laboratory (EMBL) (DE and EMBL)
    • Institute of Protein Research, Academy of Sciences (Former USSR)
    • Laboratory of Molecular Biology (LMB), MRC (UK)
    • Los Alamos National Laboratory (LANL) (USA)
    • National Biomedical Research Foundation (USA)
    • Stanford University (USA)
    • University of California San Francisco (UCSF) (USA)
    • University College, University of London (UCL) (UK)

Sequencing and Bioinformatics

  • The wide availability of methods for DNA sequencing in the early 1980s led to exponential growth in molecular sequence data
  • Data was stored in databases like GenBank and EMBL (European Molecular Biology Laboratory nucleotide sequence database).
  • The arrival of the Internet transformed databases and access to data, publications, and information infrastructure

Bioinformatics Applications:

  • CRISPR-Cas bioinformatics:
    • Bioinformatics played a key role in the discovery of the CRISPR-Cas system and identification of new functions.
    • Yoshizumi Ishino described the signature repeat-spacer architecture of CRISPR arrays in 1987.
    • Francisco Mojica, aided by bioinformatics, showed CRISPR arrays are present in Escherichia coli, most archaeal, and many bacterial genomes.

Bioinformatics tools:

  • Artemis Comparison Tool (ACT):
    • ACT identifies and analyzes regions of similarity and difference between genomes.
    • It explores the conservation of synteny in the context of the entire sequences and their annotation.
    • ACT reads complete EMBL, GENBANK, and GFF entries, and FASTA or raw format sequences.

Bioinformatics in Translational Drug Discovery

  • Disease-based bioinformatics approaches in translational drug discovery depend on the disease type.
  • They implement different strategies to analyze cancer, genetic, and infectious diseases.
  • Bioinformatics approaches help identify key drivers of cancer in patients, for personalized cancer therapy.

Bioinformatics in Clinical Genomic Sequencing

  • High-throughput sequencing technologies have transformed molecular genetic pathology into an informatics-driven specialty.
  • Increasing capabilities and reduced costs have made large-scale sequencing (exomes and genomes) accessible to many patients.

Sequencing Technologies

  • First Generation: Sanger Sequencing (500-1000 bp fragments)
  • Second Generation: 454, Solexa, Ion Torrent, Illumina (~50-500 bp fragments)
  • Third Generation: PacBio, Oxford Nanopore (Tens of kb fragments on average)

Clinical Bioinformatics Process:

  • Primary Analysis: Algorithms convert raw sequence reads to a string of As, Cs, Ts, and Gs.
  • Secondary Analysis:
    • Mapping or aligning sequence reads onto the reference genomic sequence.
    • Variant calling identifies differences between the individual's sequence and the reference genome.
  • A reference genome is a digital nucleic acid sequence database, a representative example of the set of genes.
  • Variation Viewer allows viewing, searching, and navigating variations in dbSNP, dbVar, and ClinVar.
  • Tertiary Analysis: Sequence variants are placed in the genomic and clinical context.
  • Annotation: Variants are annotated to assess their effects on biology, physiology, and clinical relevance. This involves extracting knowledge at the gene, transcript, protein, and regulatory levels from biological databases.

Datasets and Databases for Variant Annotation

  • dbSNP: Central repository for SNP/indel polymorphisms.
  • Human Gene Mutation Database (HGMD): Repository for gene lesions responsible for human inherited disease.
  • ClinVar: Repository of human variants related to phenotypes with supporting evidence.
  • LOVD: Repository of gene-centered DNA variations, storage of patient-centered data and NGS data.
  • 1000 Genomes: Project to find genetic variants with frequencies of ≥ 1%.
  • gnomAD: Resource aggregating exome and genome sequencing data from large-scale sequencing projects.
  • ExAC: Same as gnomAD.
  • Alamut: Database that integrates NCBI, EBI, UCSC, HGVS nomenclature.
  • dbNSFP: Database of human nonsynonymous SNPs and their functional predictions.

Conclusion

  • Genomic sequencing is increasingly part of clinical practice.
  • Understanding bioinformatics tools' purposes, outputs, and assumptions is crucial for molecular geneticists/pathologists.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Bioinformatics and Computational Biology Quiz
5 questions
Introduction to Bioinformatics
10 questions

Introduction to Bioinformatics

HospitableJuxtaposition avatar
HospitableJuxtaposition
Introduction to Bioinformatics
10 questions

Introduction to Bioinformatics

CourageousBambooFlute avatar
CourageousBambooFlute
MIT 7.91J: Computational and Systems Biology
20 questions
Use Quizgecko on...
Browser
Browser