The UCSC Known Genes.

Bioinformatics 2006 May 24;22(9):1036-46. Epub 2006 Feb 24.

Center for Biomolecular Science and Engineering, School of Engineering, University of California Santa Cruz Santa Cruz, CA 95064, USA.

The University of California Santa Cruz (UCSC) Known Genes dataset is constructed by a fully automated process, based on protein data from Swiss-Prot/TrEMBL (UniProt) and the associated mRNA data from Genbank. The detailed steps of this process are described. Extensive cross-references from this dataset to other genomic and proteomic data were constructed. For each known gene, a details page is provided containing rich information about the gene, together with extensive links to other relevant genomic, proteomic and pathway data. As of July 2005, the UCSC Known Genes are available for human, mouse and rat genomes. The Known Genes serves as a foundation to support several key programs: the Genome Browser, Proteome Browser, Gene Sorter and Table Browser offered at the UCSC website. All the associated data files and program source code are also available. They can be accessed at http://genome.ucsc.edu. The genomic coverage of UCSC Known Genes, RefSeq, Ensembl Genes, H-Invitational and CCDS is analyzed. Although UCSC Known Genes offers the highest genomic and CDS coverage among major human and mouse gene sets, more detailed analysis suggests all of them could be further improved.

Download full-text PDF

Source
https://academic.oup.com/bioinformatics/article-lookup/doi/1
Publisher Site
http://dx.doi.org/10.1093/bioinformatics/btl048DOI Listing
May 2006
3 Reads

Publication Analysis

Top Keywords

ucsc genes
20
genomic proteomic
8
human mouse
8
ucsc
6
genes
6
data
5
browser proteome
4
ucsc website
4
genome browser
4
programs genome
4
support key
4
key programs
4
proteome browser
4
table browser
4
browser offered
4
sorter table
4
gene sorter
4
foundation support
4
browser gene
4
offered ucsc
4

Similar Publications