Title: Porcine transcriptome analysis based on 97 non-normalized cDNA libraries and assembly of 1,021,891 expressed sequence tags
Authors: Gorodkin, Jan
Cirera, Susanna
Hedegaard, Jakob
Gilchrist, Michael J
Panitz, Frank
Jorgensen, Claus B
Scheibye-Knudsen, Karsten
Arvin, Troels
Lumholdt, Steen
Sawera, Milena
Green, Trine
Nielsen, Bente J
Havgaard, Jakob H
Rosenkilde, Carina
Wang, Jun
Li, Heng
Li, Ruiqiang
Liu, Bin
Hu, Songnian
Dong, Wei
Li, Wei
Yu, Jun
Wang, Jian
Staerfeltd, Hans-Henrik
Wernersson, Rasmus
Madsen, Lone B
Thomsen, Bo
Hornshoj, Henrik
Bujie, Zhan
Wang, Xuegang
Wang, Xuefei
Bolund, Lars
Brunak, Soren
Yang, Huanming
Bendixen, Christian
Fredholm, Merete
Issue Date: 2-Apr-2007
Abstract: Background Knowledge of the structure of gene expression is essential for mammalian transcriptomics research. We analyzed a collection of more than one million porcine expressed sequence tags (ESTs), of which two-thirds were generated in the Sino-Danish Pig Genome Project and one-third are from public databases. The Sino-Danish ESTs were generated from one normalized and 97 non-normalized cDNA libraries representing 35 different tissues and three developmental stages. Results Using the Distiller package, the ESTs were assembled to roughly 48,000 contigs and 73,000 singletons, of which approximately 25% have a high confidence match to UniProt. Approximately 6,000 new porcine gene clusters were identified. Expression analysis based on the non-normalized libraries resulted in the following findings. The distribution of cluster sizes is scaling invariant. Brain and testes are among the tissues with the greatest number of different expressed genes, whereas tissues with more specialized function, such as developing liver, have fewer expressed genes. There are at least 65 high confidence housekeeping gene candidates and 876 cDNA library-specific gene candidates. We identified differential expression of genes between different tissues, in particular brain/spinal cord, and found patterns of correlation between genes that share expression in pairs of libraries. Finally, there was remarkable agreement in expression between specialized tissues according to Gene Ontology categories. Conclusion This EST collection, the largest to date in pig, represents an essential resource for annotation, comparative genomics, assembly of the pig genome sequence, and further porcine transcription studies.
Description: RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.
URI: http://www.dspace.cam.ac.uk/handle/1810/238226
Other Identifiers: http://dx.doi.org/10.1186/gb-2007-8-4-r45
Appears in Collections:Caa-BioMed - No Cambridge University Affiliation

Files in This Item:

File Description SizeFormat
gb-2007-8-4-r45.xml164.98 kBXMLView/Open
gb-2007-8-4-r45.pdf537.33 kBAdobe PDFThumbnail
View/Open
GB-2007-8-4-R45-S3.PDF183.7 kBAdobe PDFThumbnail
View/Open
GB-2007-8-4-R45-S2.PDF202.92 kBAdobe PDFThumbnail
View/Open
GB-2007-8-4-R45-S4.PDF198.61 kBAdobe PDFThumbnail
View/Open
GB-2007-8-4-R45-S1.PDF93.37 kBAdobe PDFThumbnail
View/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item has been accessed 258 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.