Bioinformatics and Amino Acids

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Listen to an AI-generated conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

The term Bioinformatics was first coined by Paulien Hogeweg and Ben ______________ in 1960.

Hesper

The unique ring structure of ______________ can impact protein folding and stability.

Proline

The simplicity and flexibility of ______________ make it a crucial component of protein structures.

Glycine

The ability of ______________ to form disulfide bonds is essential for protein structure stabilization.

<p>Cysteine</p>
Signup and view all the answers

The amphoteric nature and ability to switch between protonation states make ______________ a crucial player in enzyme catalysis and protein function.

<p>Histidine</p>
Signup and view all the answers

Data is raw, unorganized ______ need to be processed.

<p>facts</p>
Signup and view all the answers

When data is processed, organized, structured or presented in a given context so as to make it useful it is called ______.

<p>information</p>
Signup and view all the answers

Primary Databases contain ______ data in its original form.

<p>biomolecular</p>
Signup and view all the answers

Experimental results are submitted directly into the ______ by researchers, and the data are essentially archival in nature.

<p>database</p>
Signup and view all the answers

GenBank is a ______ from NCBI, includes sequences from publically available resources.

<p>database</p>
Signup and view all the answers

EMBL is a ______ acid Database that comes under EBI.

<p>Nucleic</p>
Signup and view all the answers

DDBJ is a biological ______ that collects DNA Sequences.

<p>database</p>
Signup and view all the answers

DDBJ is located at the National Institute of Genetics in the ______ Prefecture of Japan.

<p>Shizuoka</p>
Signup and view all the answers

SWISS – PROT is a curated ______________ sequence database which strives to provide a high level of annotation.

<p>protein</p>
Signup and view all the answers

The Protein Information Resources (PIR) maintains the ______________ Sequence Database (PSD), an annotated protein database containing over 283000 sequences.

<p>Protein</p>
Signup and view all the answers

PROSITE is a protein database that consists of entries describing the protein ______________ , domains and functional sites as well as amino acid patterns and profiles in them.

<p>families</p>
Signup and view all the answers

Pfam is a database of ______________ families that includes their annotations and multiple alignments generated using hidden markov models.

<p>protein</p>
Signup and view all the answers

The general purpose of the pfam database is to provide a complete and accurate classification of ______________ families and domains.

<p>protein</p>
Signup and view all the answers

PDB (protein data bank) comprises of : 1.PDBe ( PDB of ______________ ) and 2.PDBj ( PDB of ______________ ).

<p>Europe, Japan</p>
Signup and view all the answers

SCOPE is a database that provides a structural classification of ______________.

<p>protein</p>
Signup and view all the answers

CATH is a database that provides a classification of protein structures based on ______________, Architecture, Topology, and Homology.

<p>Class</p>
Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes

Bioinformatics

  • Bioinformatics is a crucial field that combines computational tools and biological data to analyze and understand biological systems.

History of Bioinformatics

  • The term "bioinformatics" was first coined by Paulien Hogeweg and Ben Hesper in 1960.

Special Cases of Amino Acids

  • Proline (Pro): Has a unique ring structure that can impact protein folding and stability.
  • Glycine (Gly): Is a crucial component of protein structures due to its simplicity and flexibility.
  • Cysteine (Cys): Forms disulfide bonds that are essential for protein structure stabilization.
  • Histidine: Has an amphoteric nature and can switch between protonation states, making it crucial for enzyme catalysis and protein function.

Importance of Amino Acids

  • Amino acids play diverse and essential roles in protein structure, function, and overall biological processes.

Data and Information

  • Data is raw, unorganized facts that need to be processed.
  • Information is processed, organized, structured, or presented in a given context to make it useful.

Biological Databases

  • Classified into two types based on the nature of data and source.

Primary Databases

  • Contain biomolecular data in its original form.
  • Experimental results are submitted directly into the database by researchers.
  • Data is essentially archival in nature and cannot be changed further once a database accession number is assigned.
  • Examples: GenBank, EMBL, DDBJ for DNA/RNA sequences, and SWISS-PROT and PIR for protein sequences.

GenBank

  • A database from NCBI that includes sequences from publicly available resources.
  • A genetic sequence database, an annotated collection of all publicly available DNA sequences.
  • Part of the International Nucleotide Sequence Database Collaboration.

EMBL

  • European Molecular Biological Laboratory, a nucleic acid database that comes under EBI (European Bioinformatics Institute).
  • Established in collaboration with DDBJ and GenBank.
  • EBI's Sequence Retrieval System (SRS) is a network browser for databanks in molecular biology.

DDBJ

  • The DNA Data Bank of Japan, a biological database that collects DNA sequences.
  • Located at the National Institute of Genetics in Japan.
  • A member of the International Nucleotide Sequence Database Collaboration or INSDC.

SWISS-PROT

  • A curated protein sequence database that provides high-level annotation.
  • Annotations include description of protein function, domains, post-translational modifications, variants, etc.
  • Created in 1986 by Amos Bairoch with Swiss Institute of Bioinformatics.

PIR

  • The Protein Information Resources (PIR) is an integrated public resource of protein informatics.
  • Supports genomic and proteomic research and scientific discovery.
  • Maintains the Protein Sequence Database (PSD), an annotated protein database containing over 283,000 sequences.

Secondary Database

  • Contains data derived from analyzed primary data.
  • Data is either manually created or generated automatically.
  • Contains valuable information such as about mutations or evolutionary relationships.
  • Examples: PROSITE, Pfam.

PROSITE

  • A protein database that consists of entries describing protein families, domains, and functional sites.
  • Manually curated by a team of the Swiss Institute of Bioinformatics.
  • Tightly integrated into Swiss-Prot protein annotation.

Pfam

  • A database of protein families that includes their annotations and multiple alignments generated using hidden Markov models.
  • Provides a complete and accurate classification of protein families and domains.

Structural Database

  • PDB (Protein Data Bank): comprises of PDBe (PDB of Europe), PDBj (PDB of Japan), SCOPE, and CATH.
  • Contains information generated from X-Crystallgraphy and NMR Experiments.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team
Use Quizgecko on...
Browser
Browser