Podcast
Questions and Answers
The term Bioinformatics was first coined by Paulien Hogeweg and Ben ______________ in 1960.
The term Bioinformatics was first coined by Paulien Hogeweg and Ben ______________ in 1960.
Hesper
The unique ring structure of ______________ can impact protein folding and stability.
The unique ring structure of ______________ can impact protein folding and stability.
Proline
The simplicity and flexibility of ______________ make it a crucial component of protein structures.
The simplicity and flexibility of ______________ make it a crucial component of protein structures.
Glycine
The ability of ______________ to form disulfide bonds is essential for protein structure stabilization.
The ability of ______________ to form disulfide bonds is essential for protein structure stabilization.
Signup and view all the answers
The amphoteric nature and ability to switch between protonation states make ______________ a crucial player in enzyme catalysis and protein function.
The amphoteric nature and ability to switch between protonation states make ______________ a crucial player in enzyme catalysis and protein function.
Signup and view all the answers
Data is raw, unorganized ______ need to be processed.
Data is raw, unorganized ______ need to be processed.
Signup and view all the answers
When data is processed, organized, structured or presented in a given context so as to make it useful it is called ______.
When data is processed, organized, structured or presented in a given context so as to make it useful it is called ______.
Signup and view all the answers
Primary Databases contain ______ data in its original form.
Primary Databases contain ______ data in its original form.
Signup and view all the answers
Experimental results are submitted directly into the ______ by researchers, and the data are essentially archival in nature.
Experimental results are submitted directly into the ______ by researchers, and the data are essentially archival in nature.
Signup and view all the answers
GenBank is a ______ from NCBI, includes sequences from publically available resources.
GenBank is a ______ from NCBI, includes sequences from publically available resources.
Signup and view all the answers
EMBL is a ______ acid Database that comes under EBI.
EMBL is a ______ acid Database that comes under EBI.
Signup and view all the answers
DDBJ is a biological ______ that collects DNA Sequences.
DDBJ is a biological ______ that collects DNA Sequences.
Signup and view all the answers
DDBJ is located at the National Institute of Genetics in the ______ Prefecture of Japan.
DDBJ is located at the National Institute of Genetics in the ______ Prefecture of Japan.
Signup and view all the answers
SWISS – PROT is a curated ______________ sequence database which strives to provide a high level of annotation.
SWISS – PROT is a curated ______________ sequence database which strives to provide a high level of annotation.
Signup and view all the answers
The Protein Information Resources (PIR) maintains the ______________ Sequence Database (PSD), an annotated protein database containing over 283000 sequences.
The Protein Information Resources (PIR) maintains the ______________ Sequence Database (PSD), an annotated protein database containing over 283000 sequences.
Signup and view all the answers
PROSITE is a protein database that consists of entries describing the protein ______________ , domains and functional sites as well as amino acid patterns and profiles in them.
PROSITE is a protein database that consists of entries describing the protein ______________ , domains and functional sites as well as amino acid patterns and profiles in them.
Signup and view all the answers
Pfam is a database of ______________ families that includes their annotations and multiple alignments generated using hidden markov models.
Pfam is a database of ______________ families that includes their annotations and multiple alignments generated using hidden markov models.
Signup and view all the answers
The general purpose of the pfam database is to provide a complete and accurate classification of ______________ families and domains.
The general purpose of the pfam database is to provide a complete and accurate classification of ______________ families and domains.
Signup and view all the answers
PDB (protein data bank) comprises of : 1.PDBe ( PDB of ______________ ) and 2.PDBj ( PDB of ______________ ).
PDB (protein data bank) comprises of : 1.PDBe ( PDB of ______________ ) and 2.PDBj ( PDB of ______________ ).
Signup and view all the answers
SCOPE is a database that provides a structural classification of ______________.
SCOPE is a database that provides a structural classification of ______________.
Signup and view all the answers
CATH is a database that provides a classification of protein structures based on ______________, Architecture, Topology, and Homology.
CATH is a database that provides a classification of protein structures based on ______________, Architecture, Topology, and Homology.
Signup and view all the answers
Study Notes
Bioinformatics
- Bioinformatics is a crucial field that combines computational tools and biological data to analyze and understand biological systems.
History of Bioinformatics
- The term "bioinformatics" was first coined by Paulien Hogeweg and Ben Hesper in 1960.
Special Cases of Amino Acids
- Proline (Pro): Has a unique ring structure that can impact protein folding and stability.
- Glycine (Gly): Is a crucial component of protein structures due to its simplicity and flexibility.
- Cysteine (Cys): Forms disulfide bonds that are essential for protein structure stabilization.
- Histidine: Has an amphoteric nature and can switch between protonation states, making it crucial for enzyme catalysis and protein function.
Importance of Amino Acids
- Amino acids play diverse and essential roles in protein structure, function, and overall biological processes.
Data and Information
- Data is raw, unorganized facts that need to be processed.
- Information is processed, organized, structured, or presented in a given context to make it useful.
Biological Databases
- Classified into two types based on the nature of data and source.
Primary Databases
- Contain biomolecular data in its original form.
- Experimental results are submitted directly into the database by researchers.
- Data is essentially archival in nature and cannot be changed further once a database accession number is assigned.
- Examples: GenBank, EMBL, DDBJ for DNA/RNA sequences, and SWISS-PROT and PIR for protein sequences.
GenBank
- A database from NCBI that includes sequences from publicly available resources.
- A genetic sequence database, an annotated collection of all publicly available DNA sequences.
- Part of the International Nucleotide Sequence Database Collaboration.
EMBL
- European Molecular Biological Laboratory, a nucleic acid database that comes under EBI (European Bioinformatics Institute).
- Established in collaboration with DDBJ and GenBank.
- EBI's Sequence Retrieval System (SRS) is a network browser for databanks in molecular biology.
DDBJ
- The DNA Data Bank of Japan, a biological database that collects DNA sequences.
- Located at the National Institute of Genetics in Japan.
- A member of the International Nucleotide Sequence Database Collaboration or INSDC.
SWISS-PROT
- A curated protein sequence database that provides high-level annotation.
- Annotations include description of protein function, domains, post-translational modifications, variants, etc.
- Created in 1986 by Amos Bairoch with Swiss Institute of Bioinformatics.
PIR
- The Protein Information Resources (PIR) is an integrated public resource of protein informatics.
- Supports genomic and proteomic research and scientific discovery.
- Maintains the Protein Sequence Database (PSD), an annotated protein database containing over 283,000 sequences.
Secondary Database
- Contains data derived from analyzed primary data.
- Data is either manually created or generated automatically.
- Contains valuable information such as about mutations or evolutionary relationships.
- Examples: PROSITE, Pfam.
PROSITE
- A protein database that consists of entries describing protein families, domains, and functional sites.
- Manually curated by a team of the Swiss Institute of Bioinformatics.
- Tightly integrated into Swiss-Prot protein annotation.
Pfam
- A database of protein families that includes their annotations and multiple alignments generated using hidden Markov models.
- Provides a complete and accurate classification of protein families and domains.
Structural Database
- PDB (Protein Data Bank): comprises of PDBe (PDB of Europe), PDBj (PDB of Japan), SCOPE, and CATH.
- Contains information generated from X-Crystallgraphy and NMR Experiments.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Description
Explore the importance of bioinformatics and its connection to amino acids, including their unique characteristics and impact on protein structure.