Large-Scale Statistical Analyses of Rice ESTs Reveal Correlated Patterns of Gene Expression
AUTOR(ES)
Ewing, Rob M.
FONTE
Cold Spring Harbor Laboratory Press
RESUMO
Large, publicly available collections of expressed sequence tags (ESTs) have been generated from Arabidopsis thaliana and rice (Oryza sativa). A potential, but relatively unexplored application of this data is in the study of plant gene expression. Other EST data, mainly from human and mouse, have been successfully used to point out genes exhibiting tissue- or disease-specific expression, as well as for identification of alternative transcripts. In this report, we go a step further in showing that computer analyses of plant EST data can be used to generate evidence of correlated expression patterns of genes across various tissues. Furthermore, tissue types and organs can be classified with respect to one another on the basis of their global gene expression patterns. As in previous studies, expression profiles are first estimated from EST counts. By clustering gene expression profiles or whole cDNA library profiles, we show that genes with similar functions, or cDNA libraries expected to share patterns of gene expression, are grouped together. Promising uses of this technique include functional genomics, in which evidence of correlated expression might complement (or substitute for) those of sequence similarity in the annotation of anonymous genes and identification of surrogate markers. The analysis presented here combines the application of a correlation-based clustering method with a graphical color map allowing intuitive visualization of patterns within a large table of expression measurements.
ACESSO AO ARTIGO
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=310820Documentos Relacionados
- The analysis of large-scale gene expression correlated to the phase changes of the migratory locust
- GECKO: a complete large-scale gene expression analysis platform
- Large-scale temporal gene expression mapping of central nervous system development
- Large-scale gene-expression studies and the challenge of multiple sclerosis
- Subsystem Identification Through Dimensionality Reduction of Large-Scale Gene Expression Data