Podcast
Questions and Answers
What does the blastp results page illustrate?
What does the blastp results page illustrate?
- The alignment of query sequence with other sequences in the database (correct)
- The process of designing experiments
- The function of specific proteins
- The 3D structure of corn alpha-amylase
What does the top red bar in the diagrammatic representation of the blastp results page represent?
What does the top red bar in the diagrammatic representation of the blastp results page represent?
- The query sequence (correct)
- The conserved sequence regions
- The hit sequences in the database
- The protein families
What is the primary function of primary databases?
What is the primary function of primary databases?
- To provide a consensus sequence from multiple experiments
- To separate the wheat from the chaff in a sequence search
- To identify conserved domains within a protein
- To store experimental results in an accessible format (correct)
What do the two scores at the right of the blastp results page indicate?
What do the two scores at the right of the blastp results page indicate?
What is the purpose of multiple alignments of protein sequences?
What is the purpose of multiple alignments of protein sequences?
What is the purpose of Entrez?
What is the purpose of Entrez?
What is the abbreviation for European Molecular Biology Laboratory?
What is the abbreviation for European Molecular Biology Laboratory?
What is the definition of alignment in the context of bioinformatics?
What is the definition of alignment in the context of bioinformatics?
What is a bioinformatics?
What is a bioinformatics?
What is the purpose of a sequence search in Entrez?
What is the purpose of a sequence search in Entrez?
What is the URL of the NCBI website?
What is the URL of the NCBI website?
What is an algorithm in the context of bioinformatics?
What is an algorithm in the context of bioinformatics?
What is the purpose of Clustal Omega?
What is the purpose of Clustal Omega?
What is the purpose of secondary databases?
What is the purpose of secondary databases?
What is the name of the tool used for basic local alignment search?
What is the name of the tool used for basic local alignment search?
What is the first step in using Entrez?
What is the first step in using Entrez?
What is the purpose of introducing gaps in an alignment?
What is the purpose of introducing gaps in an alignment?
What is the primary goal of proteomics?
What is the primary goal of proteomics?
What is the term for a short conserved region in a protein sequence?
What is the term for a short conserved region in a protein sequence?
What is the term for a discrete portion of a protein that folds independently?
What is the term for a discrete portion of a protein that folds independently?
What is the term for the input sequence used to compare with all entries in a database?
What is the term for the input sequence used to compare with all entries in a database?
What is the term for the alignment of two nucleic acid or protein sequences over their entire length?
What is the term for the alignment of two nucleic acid or protein sequences over their entire length?
What is the main function of the PubMed link on the NCBI Home Page?
What is the main function of the PubMed link on the NCBI Home Page?
Which of the following BLAST programs compares a nucleotide query sequence against a protein sequence database?
Which of the following BLAST programs compares a nucleotide query sequence against a protein sequence database?
What is the purpose of the tbastx program?
What is the purpose of the tbastx program?
Where can you access the BLAST page from?
Where can you access the BLAST page from?
What is the main difference between the blastn and blastp programs?
What is the main difference between the blastn and blastp programs?
Which of the following is NOT a type of BLAST search?
Which of the following is NOT a type of BLAST search?
What is the purpose of the blastp program?
What is the purpose of the blastp program?
Why can't the tblastx program be used with the nr database on the BLAST Web page?
Why can't the tblastx program be used with the nr database on the BLAST Web page?
Flashcards
Primary Databases
Primary Databases
Databases containing experimental results; not consensus sequences.
Secondary Databases
Secondary Databases
Databases that use primary databases as sources, focusing on consensus sequences.
DDBJ
DDBJ
DNA Databank of Japan, a primary sequence database.
EMBL
EMBL
Signup and view all the flashcards
GenBank
GenBank
Signup and view all the flashcards
NCBI
NCBI
Signup and view all the flashcards
BLAST
BLAST
Signup and view all the flashcards
Entrez
Entrez
Signup and view all the flashcards
nucleotide blast
nucleotide blast
Signup and view all the flashcards
protein blast
protein blast
Signup and view all the flashcards
blastx
blastx
Signup and view all the flashcards
tblastn
tblastn
Signup and view all the flashcards
tblastx
tblastx
Signup and view all the flashcards
Sequence Search
Sequence Search
Signup and view all the flashcards
Multiple Sequence Alignment
Multiple Sequence Alignment
Signup and view all the flashcards
Clustal Omega
Clustal Omega
Signup and view all the flashcards
Alignment
Alignment
Signup and view all the flashcards
Algorithm
Algorithm
Signup and view all the flashcards
Conservation
Conservation
Signup and view all the flashcards
Domain
Domain
Signup and view all the flashcards
Identity
Identity
Signup and view all the flashcards
Study Notes
DNA and Protein Sequence Online Databases
- There are two types of sequence databases: primary and secondary databases
- Primary databases contain experimental results in an accessible format but are not consensus sequences
- Secondary databases are curated to reflect consensus sequences from multiple experiments and use primary databases as their sources
- Examples of primary databases: DDBJ (DNA Databank of Japan), EMBL (European Molecular Biology Laboratory), and GenBank
- Abbreviations: NCBI (National Center for Biotechnology Information), BLAST (Basic Local Alignment Search Tool)
Entrez and Sequence Search
- Entrez is a data retrieval system developed by NCBI that provides integrated access to a wide range of data domains
- Search goals:
- Identify a representative, well-annotated mRNA or protein sequence record
- Retrieve associated literature
- Identify conserved domains within the protein
- Identify similar proteins
- Find a resolved three-dimensional structure for the protein or identify structures with homologous sequence
- Steps to start: Go to the NCBI website, select a database (nucleotide or protein), and search for a gene or protein of interest
BLAST Introduction
- BLAST (Basic Local Alignment Search Tool) is a sequence comparison algorithm optimized for speed and sensitivity
- Selecting a BLAST program: nucleotide blast (blastn), protein blast (blastp), blastx, tblastn, and tblastx
- BLAST programs compare a query sequence against a database or vice versa
BLAST Results
- The blastp results page shows around 100 "Hits", or other protein sequences showing at least some similarity to the query sequence
- The illustration with red bars is a diagrammatic representation of how the query sequence lines up with other sequences in the database along the primary structure of the protein
- The two scores at the right (Ident and E value) indicate the degree of similarity
Clustal Omega and Multiple Sequence Alignment
- Clustal Omega is a DNA and protein multiple sequence alignment tool
- Multiple alignments of protein sequences are important tools in studying sequences and provide identification of conserved sequence regions
Glossary
- Alignment: The process of lining up two or more sequences to achieve maximal levels of identity and conservation
- Algorithm: A fixed procedure embodied in a computer program
- Bioinformatics: The merger of biotechnology and information technology with the goal of revealing new insights and principles in biology
- Conservation: Changes at a specific position of an amino acid or DNA sequence that preserve the physico-chemical properties of the original residue
- Domain: A discrete portion of a protein assumed to fold independently of the rest of the protein and possessing its own function
- Gap: A space introduced into an alignment to compensate for insertions and deletions in one sequence relative to another
- Global Alignment: The alignment of two nucleic acid or protein sequences over their entire length
- Identity: The extent to which two nucleotide or amino acid sequences are invariant
- Local Alignment: The alignment of some portion of two nucleic acid or protein sequences
- Motif: A short conserved region in a protein sequence
- Proteomics: The systematic analysis of protein expression in normal and diseased tissues that involves the separation, identification, and characterization of all of the proteins in an organism
- Query: The input sequence (or other type of search term) with which all of the entries in a database are to be compared
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.