Podcast
Questions and Answers
Which of the following best describes the term 'Molecular Function' in Gene Ontology?
Which of the following best describes the term 'Molecular Function' in Gene Ontology?
Cellular Component indicates the biological goal or objective of a gene product.
Cellular Component indicates the biological goal or objective of a gene product.
False
What are the two most reliable sources for Gene Ontology annotations?
What are the two most reliable sources for Gene Ontology annotations?
Papers and Experiments
In Gene Ontology, 'Orthologs' are homologues created by __________ and typically have the same function in different species.
In Gene Ontology, 'Orthologs' are homologues created by __________ and typically have the same function in different species.
Signup and view all the answers
What is one of the main uses of Gene Ontology (GO)?
What is one of the main uses of Gene Ontology (GO)?
Signup and view all the answers
Match the following types of protein homologs with their descriptions:
Match the following types of protein homologs with their descriptions:
Signup and view all the answers
What does the term 'HMMs' refer to in the context of protein function prediction?
What does the term 'HMMs' refer to in the context of protein function prediction?
Signup and view all the answers
Human chymotrypsin and bovine chymotrypsin are examples of paralogs.
Human chymotrypsin and bovine chymotrypsin are examples of paralogs.
Signup and view all the answers
What is the identity percentage that PAM 250 was developed to model?
What is the identity percentage that PAM 250 was developed to model?
Signup and view all the answers
BLOSUM62 is derived from aligned sequences of protein families called BLOCKS.
BLOSUM62 is derived from aligned sequences of protein families called BLOCKS.
Signup and view all the answers
What does the letter 'o' represent in the gap penalty formula?
What does the letter 'o' represent in the gap penalty formula?
Signup and view all the answers
The PAM250 matrix quantifies the odds that one residue is mutated from another based on the __________ of amino acid pair exchanging.
The PAM250 matrix quantifies the odds that one residue is mutated from another based on the __________ of amino acid pair exchanging.
Signup and view all the answers
Match the following terms with their definitions:
Match the following terms with their definitions:
Signup and view all the answers
What is one of the main advantages of BLOSUM62?
What is one of the main advantages of BLOSUM62?
Signup and view all the answers
The Needleman-Wunsch algorithm is used for local alignment only.
The Needleman-Wunsch algorithm is used for local alignment only.
Signup and view all the answers
What does the term 'indel' refer to in terms of sequence alignment?
What does the term 'indel' refer to in terms of sequence alignment?
Signup and view all the answers
What is the primary goal of the Needleman-Wunsch algorithm?
What is the primary goal of the Needleman-Wunsch algorithm?
Signup and view all the answers
The Smith-Waterman algorithm focuses on global alignments for sequences.
The Smith-Waterman algorithm focuses on global alignments for sequences.
Signup and view all the answers
What matrix is used for assigning similarity values in the alignment process?
What matrix is used for assigning similarity values in the alignment process?
Signup and view all the answers
The maximum match is the largest number of residues from one sequence that can be matched with another, allowing for all possible __________.
The maximum match is the largest number of residues from one sequence that can be matched with another, allowing for all possible __________.
Signup and view all the answers
Match the alignment algorithms with their characteristics:
Match the alignment algorithms with their characteristics:
Signup and view all the answers
What is the main factor considered when determining pathways in the alignment matrix?
What is the main factor considered when determining pathways in the alignment matrix?
Signup and view all the answers
The maximum match will always be found in the outer row or column of the alignment matrix.
The maximum match will always be found in the outer row or column of the alignment matrix.
Signup and view all the answers
What is introduced when a gap is included in the alignment?
What is introduced when a gap is included in the alignment?
Signup and view all the answers
What is the order of protein structure hierarchy?
What is the order of protein structure hierarchy?
Signup and view all the answers
Glycine is less flexible than Alanine.
Glycine is less flexible than Alanine.
Signup and view all the answers
What type of bond does Cystine form?
What type of bond does Cystine form?
Signup and view all the answers
The equation for free energy of folding is ΔG = ΔH - TΔS, where ΔG represents the ____.
The equation for free energy of folding is ΔG = ΔH - TΔS, where ΔG represents the ____.
Signup and view all the answers
Which amino acid is known for its rigidity due to its unique bonding?
Which amino acid is known for its rigidity due to its unique bonding?
Signup and view all the answers
Van der Waals interactions are energetically unfavorable for protein packing.
Van der Waals interactions are energetically unfavorable for protein packing.
Signup and view all the answers
Name one type of non-bonded interaction in proteins.
Name one type of non-bonded interaction in proteins.
Signup and view all the answers
What defines paralogues?
What defines paralogues?
Signup and view all the answers
Match the following amino acids with their characteristics:
Match the following amino acids with their characteristics:
Signup and view all the answers
Paralogues can have entirely unrelated functions.
Paralogues can have entirely unrelated functions.
Signup and view all the answers
What percentage identity indicates a potential orthologue when searching proteins from another species?
What percentage identity indicates a potential orthologue when searching proteins from another species?
Signup and view all the answers
Specific domain libraries can be searched via __________ at EBI.
Specific domain libraries can be searched via __________ at EBI.
Signup and view all the answers
Which of the following is true about the function transfer between orthologues?
Which of the following is true about the function transfer between orthologues?
Signup and view all the answers
Match the tools with their functionalities:
Match the tools with their functionalities:
Signup and view all the answers
Automated predictions based on homology should rely solely on local matches.
Automated predictions based on homology should rely solely on local matches.
Signup and view all the answers
What is a potential danger of automated predictions based on homology?
What is a potential danger of automated predictions based on homology?
Signup and view all the answers
Which of the following is NOT one of the four most commonly annotated functions covered by flDPnn?
Which of the following is NOT one of the four most commonly annotated functions covered by flDPnn?
Signup and view all the answers
DisProt is recognized as a secondary database for intrinsically disordered proteins.
DisProt is recognized as a secondary database for intrinsically disordered proteins.
Signup and view all the answers
What is the primary goal of Clinical Phase III in drug development?
What is the primary goal of Clinical Phase III in drug development?
Signup and view all the answers
A small molecule identified through biological screening with a desired effect is called a __________.
A small molecule identified through biological screening with a desired effect is called a __________.
Signup and view all the answers
Match the following clinical phases with their purpose:
Match the following clinical phases with their purpose:
Signup and view all the answers
What is the average participant range for Clinical Phase I?
What is the average participant range for Clinical Phase I?
Signup and view all the answers
AlphaFold2 pLDDT scores are used to predict disordered proteins.
AlphaFold2 pLDDT scores are used to predict disordered proteins.
Signup and view all the answers
What is a lead in drug discovery?
What is a lead in drug discovery?
Signup and view all the answers
Study Notes
Bioinformatics - Protein Structure
- Protein Structure Hierarchy: Primary -> Secondary -> Tertiary -> Quaternary
- Protein Backbone: The core structure of proteins, composed of repeating amino acid units.
- Amino Acid Chirality: The Ca is chiral, following the CO-R-N clockwise order for L-form.
- Amino Acid Residues: Different types of amino acids are listed with their abbreviations and side chain characteristics; Glycine is remarkably flexible compared to Alanine due to its lack of a side chain.
- Proline: Has less backbone flexibility due to its covalent bond with amide nitrogen, giving rigidity.
Protein Primary Structure
- Definition: The linear sequence of amino acid residues that comprise a protein.
- Main-chain: The unchanging portion of the protein structure.
- Side-chains: Variable amino acid side chains attached to the main chain.
- Chirality: The amino acid residues are generally L-form.
Protein Secondary Structure
- Alpha Helix: Most common arrangement in protein secondary structure; it has a right-handed helix conformation with interconnecting hydrogen bonds.
- Beta Sheets: Formed from beta strands using hydrogen bonds; adjacent beta-strands can form antiparallel, parallel or mixed arrangements, with the antiparallel arrangement having the strongest inter-strand bonds.
Protein Tertiary Structure
- 3D Structure: Three-dimensional arrangement of a single protein chain with hydrophobic portions in the core.
- Stabilization: Stabilised by hydrophobic interactions, and electrostatic interactions
- Resolution: Determined at near atomic resolution by X-ray crystallography, NMR and electron microscopy.
Protein Quaternary Structure
- Multiple Chains: Generally symmetric arrangement of multiple protein chains,
- Protein Categories: Three categories of proteins: Transmembrane, Globular, and Fibrous.
Thermodynamics of Protein Folding
- ΔG=ΔH-TAS: The Gibbs free energy of folding is the difference of enthalpy ("ΔH") and temperature-dependent entropy ("TAS").
- ΔΗ: Changes in heat (e.g. electrostatics and packing).
- T: Temperature
- ΔS: Entropy, measures randomness/disorder in the system.
Protein Domains
- Structure, Unit, Evolution: Formed from functional parts known as domains, generally a distinct structural and evolutionary unit.
- Classification: Domains are classified into different fold classes (ex. α/α, β/β , α/β) .
Database information
- Protein Data Bank (PDB): A primary database comprising an extensive collection of protein structures.
- Secondary Databases (SCOP, CATH, ECOD): Databases focused on protein family classifications.
Sequence Alignment
- Pairwise Protein Sequence Alignment: Establishing similarity between two sequences using a scoring scheme.
- Scoring Scheme: Assigns a score to identical amino acids (1) and different ones (0) with more advanced schemes including Point Accepted Mutation (PAM).
- Alignment Algorithms: The Needleman-Wunsch and Smith-Waterman algorithms commonly used for sequence alignment.
Database Searching
- BLAST: A widely used algorithm for finding similar sequences from databases.
- PSIBLAST: Enhanced version of BLAST that uses multiple sequences for greater accuracy.
- MMseqs2: A local search program 50x faster than Smith Waterman.
Protein Structure Prediction
- Reasons: To predict structure from sequences understanding how environment dictates this and to guide rational drug design, mutagenesis studies and analysis
- Accuracy: Quantified using RMSD (Root Mean Square Deviation); Useful for close structures. And TM (Template Modelling)
- Template-based prediction: Utilizing similar known structures as templates for comparisons from database(s) to model a new structure, often highly accurate. More generalized models for predicting proteins.
- Ab initio: predicting structure from scratch based on physico-chemical properties of protein.
Protein Docking
- Need: To predict the structure of a complex starting from the unbound components.
- Ab initio: This was the first approach; it involved lots of random pairings/complex evaluations to evaluate which ones are likely most correct to fit together
- Computer programs: Used to perform docking calculations to identify and refine solutions
- Template approach: This is using known protein-protein complex structures as templates.
Protein Function
- Gene Ontology (GO): Controlled vocabulary to describe gene product functions.
- Types: Molecular Function, Biological Process, Cellular Component
- Approaches: Homology (searching) and structure-based predictions to infer function.
- Domains/Motifs: Use known function of conserved domains/motifs to predict the function of a new sequence.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Test your knowledge on Gene Ontology, focusing on molecular functions, protein homologs, and computational methods for function predictions. This quiz covers essential concepts and reliable sources pertaining to Gene Ontology annotations.