..

బయోమెట్రిక్స్ & బయోస్టాటిస్టిక్స్ జర్నల్

మాన్యుస్క్రిప్ట్ సమర్పించండి arrow_forward arrow_forward ..

Locating CpG Islands with Kullback-Leibler Divergence

Abstract

Yung-Pin Chen, Andrew Dittmore, Yasuhiro Goda, Alicia Laughton and Jessica Minnier

A CpG island is a short contiguous DNA subsequence that is rich in CG dinucleotides. CpG islands are often located around the promoters of housekeeping genes and have been found associated with certain tissue-specific genes. This observation indicates that they can be used as markers to identify genes. The information about the locations of CpG islands can also help us understand a gene regulation process called methylation. In this report, we propose a statistical method for locating CpG islands. Our method employs the Kullback-Leibler divergence. We use the given DNA sequence to determine a window size and a shift size for computing the divergence values along a DNA segment. A region in the proximity of a CpG island should contain consecutive windows with high divergence values. The distribution of the Kullback-Leibler divergence values can be suitably fitted by a truncated Pareto distribution. We estimate the parameters of the truncated Pareto distribution via the maximum likelihood principle. Then the fitted distribution is applied to locate regions with a divergence value exceeding a threshold level of significance. To assess the accuracy of our method, we compare our results to the putative CpG islands found in four well-studied mouse and human DNA sequences. The comparison suggests our approach consistently yields reliable predictions of CpG island locations.

నిరాకరణ: ఈ సారాంశం ఆర్టిఫిషియల్ ఇంటెలిజెన్స్ టూల్స్ ఉపయోగించి అనువదించబడింది మరియు ఇంకా సమీక్షించబడలేదు లేదా నిర్ధారించబడలేదు

ఈ కథనాన్ని భాగస్వామ్యం చేయండి

ఇండెక్స్ చేయబడింది

arrow_upward arrow_upward