Title: Some statistical properties of regulatory DNA sequences, and their use in predicting regulatory regions in the Drosophilagenome: the fluffy-tail test
Authors: Abnizova, Irina
te Boekhorst, Rene
Walter, Klaudia
Gilks, Walter R
Issue Date: 27-Apr-2005
Abstract: Abstract Background This paper addresses the problem of recognising DNA cis-regulatory modules which are located far from genes. Experimental procedures for this are slow and costly, and computational methods are hard, because they lack positional information. Results We present a novel statistical method, the "fluffy-tail test", to recognise regulatory DNA. We exploit one of the basic informational properties of regulatory DNA: abundance of over-represented transcription factor binding site (TFBS) motifs, although we do not look for specific TFBS motifs, per se . Though overrepresentation of TFBS motifs in regulatory DNA has been intensively exploited by many algorithms, it is still a difficult problem to distinguish regulatory from other genomic DNA. Conclusion We show that, in the data used, our method is able to distinguish cis-regulatory modules by exploiting statistical differences between the probability distributions of similar words in regulatory and other DNA. The potential application of our method includes annotation of new genomic sequences and motif discovery.
Description: RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.
URI: http://www.dspace.cam.ac.uk/handle/1810/237686
Other Identifiers: http://dx.doi.org/10.1186/1471-2105-6-109
Appears in Collections:Caa-BioMed - No Cambridge University Affiliation

Files in This Item:

File Description SizeFormat
1471-2105-6-109.xml71.25 kBXMLView/Open
1471-2105-6-109-S7.DOC29 kBMicrosoft WordView/Open
1471-2105-6-109-S8.DOC793 kBMicrosoft WordView/Open
1471-2105-6-109-S9.DOC792 kBMicrosoft WordView/Open
1471-2105-6-109-S1.DOC24.5 kBMicrosoft WordView/Open
1471-2105-6-109-S11.DOC930 kBMicrosoft WordView/Open
1471-2105-6-109-S6.DOC334 kBMicrosoft WordView/Open
1471-2105-6-109.pdf778.46 kBAdobe PDFThumbnail
View/Open
1471-2105-6-109-S3.DOC335 kBMicrosoft WordView/Open
1471-2105-6-109-S4.DOC23.5 kBMicrosoft WordView/Open
1471-2105-6-109-S5.DOC25 kBMicrosoft WordView/Open
1471-2105-6-109-S12.DOC1.01 MBMicrosoft WordView/Open
1471-2105-6-109-S2.DOC564 kBMicrosoft WordView/Open
1471-2105-6-109-S10.DOC758.5 kBMicrosoft WordView/Open
Additional resources for this item
search for alternative versions in eresources@cambridge
retrieve citation metadata in EndNote format

This item has been accessed 238 times.

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.