Prediction of promoters and enhancers using multiple DNA methylation-associated features

Show simple item record

dc.contributor.author Hwang, W en
dc.contributor.author Oliver, Verity en
dc.contributor.author Merbs, SL en
dc.contributor.author Zhu, H en
dc.contributor.author Qian, J en
dc.date.accessioned 2015-09-04T05:01:00Z en
dc.date.issued 2015 en
dc.identifier.citation BMC Genomics, 2015, 16 (Suppl 7), pp. S11 - S11 en
dc.identifier.issn 1471-2164 en
dc.identifier.uri http://hdl.handle.net/2292/26883 en
dc.description.abstract Background Regulatory regions (e.g. promoters and enhancers) play an essential role in human development and disease. Many computational approaches have been developed to predict the regulatory regions using various genomic features such as sequence motifs and evolutionary conservation. However, these DNA sequence-based approaches do not reflect the tissue-specific nature of the regulatory regions. In this work, we propose to predict regulatory regions using multiple features derived from DNA methylation profile. Results We discovered several interesting features of the methylated CpG (mCpG) sites within regulatory regions. First, a hypomethylation status of CpGs within regulatory regions, compared to the genomic background methylation level, extended out >1000 bp from the center of the regulatory regions, demonstrating a high degree of correlation between the methylation statuses of neighboring mCpG sites. Second, when a regulatory region was inactive, as determined by histone mark differences between cell lines, methylation level of the mCpG site increased from a hypomethylated state to a hypermethylated state, the level of which was even higher than the genomic background. Third, a distinct set of sequence motifs was overrepresented surrounding mCpG sites within regulatory regions. Using 5 types of features derived from DNA methylation profiles, we were able to predict promoters and enhancers using machine-learning approach (support vector machine). The performances for prediction of promoters and enhancers are quite well, showing an area under the ROC curve (AUC) of 0.992 and 0.817, respectively, which is better than that simply based on methylation level, especially for prediction of enhancers. Conclusions Our study suggests that DNA methylation features of mCpG sites can be used to predict regulatory regions. en
dc.relation.ispartofseries BMC Genomics en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. Details obtained from http://www.sherpa.ac.uk/romeo/issn/1064-3745/ en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by/4.0/ en
dc.title Prediction of promoters and enhancers using multiple DNA methylation-associated features en
dc.type Journal Article en
dc.identifier.doi 10.1186/1471-2164-16-S7-S11 en
pubs.issue Suppl 7 en
pubs.begin-page S11 en
pubs.volume 16 en
dc.description.version VoR - Version of Record en
dc.identifier.pmid 26099324 en
pubs.end-page S11 en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en
pubs.subtype Article en
pubs.elements-id 489080 en
pubs.record-created-at-source-date 2015-06-29 en
pubs.dimensions-id 26099324 en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics