Title: Empirical Bayesian models for analysing molecular serotyping microarrays
Authors: Newton, Richard
Hinds, Jason
Wernisch, Lorenz
Issue Date: 31-Mar-2011
Citation: BMC Bioinformatics 2011, 12:88
Abstract: Abstract Background Microarrays offer great potential as a platform for molecular diagnostics, testing clinical samples for the presence of numerous biomarkers in highly multiplexed assays. In this study applied to infectious diseases, data from a microarray designed for molecular serotyping of Streptococcus pneumoniae was used, identifying the presence of any one of 91 known pneumococcal serotypes from DNA extracts. This microarray incorporated oligonucleotide probes for all known capsular polysaccharide synthesis genes and required a statistical analysis of the microarray intensity data to determine which serotype, or combination of serotypes, were present within a sample based on the combination of genes detected. Results We propose an empirical Bayesian model for calculating the probabilities of combinations of serotypes from the microarray data. The model takes into consideration the dependencies between serotypes, induced by genes they have in common, and by homologous genes which, although not identical, are similar to each other in sequence. For serotypes which are very similar in capsular gene composition, extra probes are included on the microarray, providing additional information which is integrated into the Bayesian model. For each serotype combination with high probability, a second model, a Bayesian random effects model is applied to determine the relative abundance of each serotype. Conclusions To assess the accuracy of the proposed analysis we applied our methods to experimental data from samples containing individual serotypes and samples containing combinations of serotypes with known levels of abundance. All but two of the known serotypes of S. pneumoniae that were tested as individual samples could be uniquely determined by the Bayesian model. The model also enabled the presence of combinations of serotypes within samples to be determined. Serotypes with very low abundance within a combination of serotypes can be detected (down to 2% abundance in this study). As well as detecting the presence of serotype combinations, an approximate measure of the percentage abundance of the serotypes within the combination can be obtained.
Description: RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.
URI: http://www.dspace.cam.ac.uk/handle/1810/237588
http://dx.doi.org/10.1186/1471-2105-12-88
Appears in Collections:Scholarly Works - Public Health

Files in This Item:

File Description SizeFormat
1471-2105-12-88.xml99.78 kBXMLView/Open
1471-2105-12-88-S2.PDF32.49 kBAdobe PDFThumbnail
View/Open
1471-2105-12-88.pdf470.95 kBAdobe PDFThumbnail
View/Open
1471-2105-12-88-S1.PDF30.82 kBAdobe PDFThumbnail
View/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item has been accessed 179 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.