Bioinformatics and Amino Acids

Podcast

Listen to an AI-generated conversation about this lesson

Download our mobile app to listen on the go

Get App

Questions and Answers

The term Bioinformatics was first coined by Paulien Hogeweg and Ben ______________ in 1960.

Hesper

The unique ring structure of ______________ can impact protein folding and stability.

Proline

The simplicity and flexibility of ______________ make it a crucial component of protein structures.

Glycine

The ability of ______________ to form disulfide bonds is essential for protein structure stabilization.

Cysteine

Signup and view all the answers

The amphoteric nature and ability to switch between protonation states make ______________ a crucial player in enzyme catalysis and protein function.

Histidine

Signup and view all the answers

Data is raw, unorganized ______ need to be processed.

facts

Signup and view all the answers

When data is processed, organized, structured or presented in a given context so as to make it useful it is called ______.

information

Signup and view all the answers

Primary Databases contain ______ data in its original form.

biomolecular

Signup and view all the answers

Experimental results are submitted directly into the ______ by researchers, and the data are essentially archival in nature.

database

Signup and view all the answers

GenBank is a ______ from NCBI, includes sequences from publically available resources.

database

Signup and view all the answers

EMBL is a ______ acid Database that comes under EBI.

Nucleic

Signup and view all the answers

DDBJ is a biological ______ that collects DNA Sequences.

database

Signup and view all the answers

DDBJ is located at the National Institute of Genetics in the ______ Prefecture of Japan.

Shizuoka

Signup and view all the answers

SWISS – PROT is a curated ______________ sequence database which strives to provide a high level of annotation.

protein

Signup and view all the answers

The Protein Information Resources (PIR) maintains the ______________ Sequence Database (PSD), an annotated protein database containing over 283000 sequences.

Protein

Signup and view all the answers

PROSITE is a protein database that consists of entries describing the protein ______________ , domains and functional sites as well as amino acid patterns and profiles in them.

families

Signup and view all the answers

Pfam is a database of ______________ families that includes their annotations and multiple alignments generated using hidden markov models.

protein

Signup and view all the answers

The general purpose of the pfam database is to provide a complete and accurate classification of ______________ families and domains.

protein

Signup and view all the answers

PDB (protein data bank) comprises of : 1.PDBe ( PDB of ) and 2.PDBj ( PDB of ).

Europe, Japan

Signup and view all the answers

SCOPE is a database that provides a structural classification of ______________.

protein

Signup and view all the answers

CATH is a database that provides a classification of protein structures based on ______________, Architecture, Topology, and Homology.

Class

Signup and view all the answers

Flashcards are hidden until you start studying

Study Notes

Bioinformatics

Bioinformatics is a crucial field that combines computational tools and biological data to analyze and understand biological systems.

History of Bioinformatics

The term "bioinformatics" was first coined by Paulien Hogeweg and Ben Hesper in 1960.

Special Cases of Amino Acids

Proline (Pro): Has a unique ring structure that can impact protein folding and stability.
Glycine (Gly): Is a crucial component of protein structures due to its simplicity and flexibility.
Cysteine (Cys): Forms disulfide bonds that are essential for protein structure stabilization.
Histidine: Has an amphoteric nature and can switch between protonation states, making it crucial for enzyme catalysis and protein function.

Importance of Amino Acids

Amino acids play diverse and essential roles in protein structure, function, and overall biological processes.

Data and Information

Data is raw, unorganized facts that need to be processed.
Information is processed, organized, structured, or presented in a given context to make it useful.

Biological Databases

Classified into two types based on the nature of data and source.

Primary Databases

Contain biomolecular data in its original form.
Experimental results are submitted directly into the database by researchers.
Data is essentially archival in nature and cannot be changed further once a database accession number is assigned.
Examples: GenBank, EMBL, DDBJ for DNA/RNA sequences, and SWISS-PROT and PIR for protein sequences.

GenBank

A database from NCBI that includes sequences from publicly available resources.
A genetic sequence database, an annotated collection of all publicly available DNA sequences.
Part of the International Nucleotide Sequence Database Collaboration.

EMBL

European Molecular Biological Laboratory, a nucleic acid database that comes under EBI (European Bioinformatics Institute).
Established in collaboration with DDBJ and GenBank.
EBI's Sequence Retrieval System (SRS) is a network browser for databanks in molecular biology.

DDBJ

The DNA Data Bank of Japan, a biological database that collects DNA sequences.
Located at the National Institute of Genetics in Japan.
A member of the International Nucleotide Sequence Database Collaboration or INSDC.

SWISS-PROT

A curated protein sequence database that provides high-level annotation.
Annotations include description of protein function, domains, post-translational modifications, variants, etc.
Created in 1986 by Amos Bairoch with Swiss Institute of Bioinformatics.

PIR

The Protein Information Resources (PIR) is an integrated public resource of protein informatics.
Supports genomic and proteomic research and scientific discovery.
Maintains the Protein Sequence Database (PSD), an annotated protein database containing over 283,000 sequences.

Secondary Database

Contains data derived from analyzed primary data.
Data is either manually created or generated automatically.
Contains valuable information such as about mutations or evolutionary relationships.
Examples: PROSITE, Pfam.

PROSITE

A protein database that consists of entries describing protein families, domains, and functional sites.
Manually curated by a team of the Swiss Institute of Bioinformatics.
Tightly integrated into Swiss-Prot protein annotation.

Pfam

A database of protein families that includes their annotations and multiple alignments generated using hidden Markov models.
Provides a complete and accurate classification of protein families and domains.

Structural Database

PDB (Protein Data Bank): comprises of PDBe (PDB of Europe), PDBj (PDB of Japan), SCOPE, and CATH.
Contains information generated from X-Crystallgraphy and NMR Experiments.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Bioinformatics and Amino Acids

Choose a study mode

Podcast

Questions and Answers

The term Bioinformatics was first coined by Paulien Hogeweg and Ben ______________ in 1960.

The unique ring structure of ______________ can impact protein folding and stability.

The simplicity and flexibility of ______________ make it a crucial component of protein structures.

The ability of ______________ to form disulfide bonds is essential for protein structure stabilization.

The amphoteric nature and ability to switch between protonation states make ______________ a crucial player in enzyme catalysis and protein function.

Data is raw, unorganized ______ need to be processed.

When data is processed, organized, structured or presented in a given context so as to make it useful it is called ______.

Primary Databases contain ______ data in its original form.

Experimental results are submitted directly into the ______ by researchers, and the data are essentially archival in nature.

GenBank is a ______ from NCBI, includes sequences from publically available resources.

EMBL is a ______ acid Database that comes under EBI.

DDBJ is a biological ______ that collects DNA Sequences.

DDBJ is located at the National Institute of Genetics in the ______ Prefecture of Japan.

SWISS – PROT is a curated ______________ sequence database which strives to provide a high level of annotation.

The Protein Information Resources (PIR) maintains the ______________ Sequence Database (PSD), an annotated protein database containing over 283000 sequences.

PROSITE is a protein database that consists of entries describing the protein ______________ , domains and functional sites as well as amino acid patterns and profiles in them.

Pfam is a database of ______________ families that includes their annotations and multiple alignments generated using hidden markov models.

The general purpose of the pfam database is to provide a complete and accurate classification of ______________ families and domains.

PDB (protein data bank) comprises of : 1.PDBe ( PDB of ______________ ) and 2.PDBj ( PDB of ______________ ).

SCOPE is a database that provides a structural classification of ______________.

CATH is a database that provides a classification of protein structures based on ______________, Architecture, Topology, and Homology.

Study Notes

Bioinformatics

History of Bioinformatics

Special Cases of Amino Acids

Importance of Amino Acids

Data and Information

Biological Databases

Primary Databases

GenBank

EMBL

DDBJ

SWISS-PROT

PIR

Secondary Database

PROSITE

Pfam

Structural Database

Studying That Suits You

More Like This

Quiz sur la diversité des protéines et des acides aminés

Protein Structure and Proteomics Quiz

Amino Acid pKa Quiz and Flashcards

Bioinformatics Lecture 8: Structural Bioinformatics

PDB (protein data bank) comprises of : 1.PDBe ( PDB of ) and 2.PDBj ( PDB of ).