50 Questions
What is the primary function of DNA microarrays?
To monitor thousands of genes simultaneously
What is the advantage of using DNA microarrays over traditional methods?
They can analyze thousands of genes simultaneously
What is the purpose of the DNA sequences on a microarray?
To act as probes to identify genes that are transcribed
How are the DNA fragments on a microarray typically prepared?
They are amplified by PCR and then spotted onto the microarray
What is the purpose of a single slide on a DNA microarray?
To analyze thousands of genes simultaneously
What is the technology used to make DNA microarrays similar to?
An inkjet printer
What is an application of DNA microarrays?
To compare microarray data using cDNAs derived from RNA of different cell types
What is the result of using a DNA microarray as a hybridization tool?
The identification of genes that are transcribed
What is the primary goal of functional genomics in a given species?
To elucidate the roles of genetic sequences
What is the term used to describe the entire collection of proteins that an organism can make?
Proteome
What is the primary focus of epigenomics?
To study the chemicals that can tell the genome what to do
What is the term used to describe the large-scale study of small molecules within cells, biofluids, tissues or organisms?
Metabolomics
What is the primary focus of bioinformatics?
To extract information from genetic data using a mathematical approach
What is the term used to describe the analysis of the complete set of gene expression?
Transcriptomics
What is the term used to describe the sum of constituents within a cell at all levels?
Omics
What is the primary goal of proteomics?
To understand the interplay among many different proteins
What is the primary purpose of functional microarrays?
To probe the function of proteins
What is the term for the marriage between genetics and biocomputing?
Bioinformatics
What are the three basic components of computer analysis of genetic sequences?
Computer, data, and program
What is the primary function of a computer program in bioinformatics?
To analyze data in a desired way
What is the purpose of generating a computer data file?
To collect genetic data in a form suitable for storage and manipulation by a computer
How are genetic sequences typically entered into a computer file?
Manually and by instruments
What is one of the ways genetic sequences can be analyzed?
To identify functional sequences
What is the term for the study of the evolutionary relationship between genetic sequences?
Phylogenetics
What is the primary goal of gene prediction?
To identify regions of genomic DNA that encode genes
What strategy do computer programs use to locate genes by searching for specific signals?
Searching for an organization of known sequence elements
What is the primary difference between search by signal and search by content strategies?
Search by signal searches for known sequence elements, while search by content searches for sequences that differ significantly from a random distribution
What is an open reading frame (ORF)?
A nucleotide sequence that does not contain any stop codons
Why are there six possible reading frames in a newly discovered sequence?
Because there are three reading frames in each direction (5' to 3' and 3' to 5')
What is the significance of homologous genes?
They are derived from the same ancestral gene
What is the purpose of translating a genomic DNA sequence in all three reading frames?
To locate coding regions within a DNA sequence
Why are long ORFs contained within chromosomal gene sequences in prokaryotes?
Because introns are not present in prokaryotes
What is the primary goal of using a tool like BLAST?
To identify the function of genetic sequences
What does a small E-value in BLAST indicate?
That the similarity is unlikely to be due to random events
Why is homology among protein sequences easier to identify than is DNA sequence homology?
Because the genetic code is degenerate
What is the purpose of the E-value in BLAST?
To represent the number of times a match would be expected to occur by random chance
What is the result of using a query sequence in BLAST?
A list of homologous sequences with corresponding E-values
Why are the results of a BLAST search typically sorted by E-value?
Because the E-value determines the significance of the match
What is the advantage of using BLAST over other sequence comparison methods?
BLAST can search large databases quickly
What is the primary input for a BLAST search?
A query sequence
What is the purpose of multiple sequence alignment?
To identify conserved sites among homologs
What is the characteristic of conserved sites in homologs?
They are identical or similar across multiple homologs
How many paralogs of the globin gene family are functionally expressed in humans?
9
What is the composition of hemoglobin protein?
Two α-chains and two β-chains
During what stage of development are the ζ and ε genes expressed?
Early embryonic development
What can be inferred from the high degree of sequence similarity between species?
The function of the sequence is similar
Who originally proposed the multiple sequence alignment approach?
Saul Needleman and Christian Wunsch
What is the result of comparing the sequences of the hemoglobin chains?
Insight into their structure and function
What is the category of the β gene in the globin gene family?
β-chains
What is the purpose of multiple sequence alignment in the globin gene family?
To compare the sequences of the hemoglobin chains
Study Notes
Omics
- In biology, the word "omics" refers to the sum of constituents within a cell at all levels.
- There are several types of omics, including:
- Epigenomics: deals with chemicals that can tell the genome what to do
- Proteomics: studies the entire collection of proteins that an organism can make
- Bioinformatics: analyzes biological information using a mathematical/computational approach
- Transcriptomics: analyzes the complete set of gene expression
- Metabolomics: is the large-scale study of small molecules, commonly known as metabolites, within cells, biofluids, tissues, or organisms
Functional Genomics
- The goal of functional genomics is to elucidate the roles of genetic sequences in a given species.
- It aims to understand gene function and the interplay among many different genes.
- Recent genome-sequencing projects have enabled the study of gene function at a more complex level.
- We can now examine groups of many genes simultaneously.
DNA Microarrays
- DNA microarrays (also called gene chips) are a technology that makes it possible to monitor thousands of genes simultaneously.
- A DNA microarray is a small silica, glass, or plastic slide that is dotted with many sequences of DNA.
- Each of these sequences corresponds to a known gene.
- The DNA fragments on a microarray can be either amplified by PCR and then spotted onto the microarray or synthesized directly on the microarray itself.
- A single slide contains tens of thousands of different spots in an area the size of a postage stamp.
- The technology for making DNA microarrays is quite amazing, involving spotting technologies that are quite similar to the way that an inkjet printer works.
Applications of DNA Microarrays
- DNA microarrays can be used to:
- Compare microarray data using cDNAs derived from RNA of different cell types to identify genes that are expressed in a cell-specific manner.
- Identify genes that are expressed in a cell-specific manner.
- Probe the function of proteins.
Bioinformatics
- The computer has become an important tool in genetic studies.
- The marriage between genetics and biocomputing has yielded an important branch of science: bioinformatics.
- Computer analysis of genetic sequences usually relies on three basic components: a computer, a computer program, and some type of data.
- A computer program is a defined series of operations that can analyze data in a desired way.
- A first step in the computer analysis of genetic data is the generation of a computer data file.
- The genetic sequence can be analyzed in many ways, including:
- Does a sequence contain a gene?
- Where are functional sequences, such as promoters and splice sites?
- Does a sequence encode a polypeptide?
- Is a sequence homologous to other sequences?
- What is the evolutionary relationship between two or more genetic sequences?
Approaches to Identify Genes
- Gene prediction refers to the process of identifying regions of genomic DNA that encode genes.
- Computer programs can employ different strategies to locate genes, including:
- Search by signal: looking for an organization of known sequence elements that are normally found within a gene.
- Search by content: looking for sequences that differ significantly from a random distribution due to codon bias within protein-encoding genes.
Open Reading Frames
- Another way to locate coding regions within a DNA sequence is to examine reading frames.
- In a DNA sequence, the reading of codons could begin with the first, second, or third nucleotide.
- These are called reading frame 1, 2, and 3, respectively.
- An open reading frame (ORF) is a nucleotide sequence that does not contain any stop codons.
- In prokaryotes, long ORFs are contained within the chromosomal gene sequences.
- In eukaryotes, however, the chromosomal coding sequences may be interrupted by introns.
Homologous Genes
- When comparing genetic sequences, researchers sometimes find two or more similar sequences.
- Homology between genetic sequences can be identified by computer programs and databases.
- A strong correlation is typically found between homology and function.
- Homologous genes are derived from the same ancestral gene.
BLAST
- BLAST (Basic Local Alignment Search Tool) is a program that starts with a genetic sequence and then locates homologous sequences in a large database.
- Homology among protein sequences is easier to identify than is DNA sequence homology.
- The relationship between the query sequence and each matching sequence is given an E-value (Expect value).
- The E-value represents the number of times that the match or a better one would be expected to occur purely by random chance in the entire database.
Proteomics
- Proteomics is the study of the functional roles of proteins in a species.
- The entire collection of proteins in a species is known as the proteome.
- Genomic data can provide insights into the proteome, but may not accurately measure protein abundance.
The Proteome is Larger than the Genome
- The proteome is larger than the genome due to cellular processes such as:
- Alternative splicing
- RNA editing
- Post-translational covalent modification
Alterations that Affect the Proteome
- Alternative splicing:
- Occurs in eukaryotes
- A single pre-mRNA is spliced into multiple versions
- Splicing is often cell-specific or related to environmental conditions
- RNA editing:
- Less common than alternative splicing
- Leads to changes in the coding sequence of mRNA after it is made
DNA Microarrays
- A DNA microarray is a technology that allows for the simultaneous monitoring of thousands of genes.
- It consists of a small silica, glass, or plastic slide dotted with many DNA sequences, each corresponding to a known gene.
- The DNA fragments on the microarray can be:
- Amplified by PCR and then spotted onto the microarray
- Synthesized directly on the microarray
- A single slide can contain tens of thousands of different spots, with the relative location of each spot known.
Applications of DNA Microarrays
- Cell-specific gene expression: Comparing microarray data using cDNAs derived from RNA of different cell types can identify genes that are expressed in a cell-specific manner.
- Functional microarrays: Consist of many different cellular proteins, used to probe the function of proteins.
Bioinformatics
- The computer has become an important tool in genetic studies, combining genetics and biocomputing to form the field of bioinformatics.
- Computer analysis of genetic sequences relies on three basic components:
- A computer
- A computer program
- Some type of data
Sequence Files and Computer Programs
- A computer program is a defined series of operations that can analyze data in a desired way.
- A computer data file is a collection of information in a form suitable for storage and manipulation by a computer.
- Entering data into a computer file can be done manually or by instruments that read data directly from a sequencing ladder.
Analyzing Genetic Sequences
- Computer analysis of genetic sequences can answer questions such as:
- Does a sequence contain a gene?
- Where are functional sequences, such as promoters and splice sites?
- Does a sequence encode a polypeptide?
- Is a sequence homologous to other sequences?
- What is the evolutionary relationship between two or more genetic sequences?
Homologous Genes
- Homologous genes are genes that have evolved from a common ancestral gene.
- Orthologs: Homologous genes found in different species.
- Paralogs: Homologous genes found in a single organism.
- A gene family consists of two or more copies of homologous genes within the genome of a single organism.
Computer Databases
- Computer databases store large amounts of genetic information generated by researchers.
- Databases contain annotations, including the genetic sequence and a concise description of it, as well as other features of significance.
- Examples of major computer databases include GenBank, EMBL, and DDBJ.
Multiple Sequence Alignment
- This approach compares homologous genes to identify conserved sites and understand their structure and function.
- It was originally proposed by Saul Needleman and Christian Wunsch in 1970.
- The globin gene family in humans is an example of multiple sequence alignment.
Globin Gene Family
- The globin gene family consists of 9 paralogs that are functionally expressed in humans.
- The 9 paralogs fall into two categories: α-chains and β-chains.
- The composition of hemoglobin changes during development, with different genes expressed during early embryonic development and in the adult.
- Comparing the sequences of the hemoglobin chains can give insight into their structure and function.
Test your understanding of functional genomics, proteomics, and bioinformatics with this quiz. Covers key concepts from Chapter 23 of Genetics: Analysis & Principles by Robert J. Brooker.
Make Your Own Quizzes and Flashcards
Convert your notes into interactive study material.
Get started for free