Long-range correlation properties of coding and noncoding DNA sequences: GenBank analysis

S. V. Buldyrev, A. L. Goldberger, S. Havlin, R. N. Mantegna, M. E. Matsa, C.-K. Peng, M. Simons, and H. E. Stanley
Phys. Rev. E 51, 5084 – Published 1 May 1995
PDFExport Citation


An open question in computational molecular biology is whether long-range correlations are present in both coding and noncoding DNA or only in the latter. To answer this question, we consider all 33 301 coding and all 29 453 noncoding eukaryotic sequences—each of length larger than 512 base pairs (bp—in the present release of the GenBank to determine whether there is any statistically significant distinction in their long-range correlation properties. Standard fast Fourier transform (FFT) analysis indicates that coding sequences have practically no correlations in the range from 10 bp to 100 bp (spectral exponent β=0.00±0.04, where the uncertainty is two standard deviations). In contrast, for noncoding sequences, the average value of the spectral exponent β is positive (0.16±0.05), which unambiguously shows the presence of long-range correlations. We also separately analyze the 874 coding and the 1157 noncoding sequences that have more than 4096 bp and find a larger region of power-law behavior. We calculate the probability that these two data sets (coding and noncoding) were drawn from the same distribution and we find that it is less than 1010. We obtain independent confirmation of these findings using the method of detrended fluctuation analysis (DFA), which is designed to treat sequences with statistical heterogeneity, such as DNA’s known mosaic structure (‘‘patchiness’’) arising from the nonstationarity of nucleotide concentration. The near-perfect agreement between the two independent analysis methods, FFT and DFA, increases the confidence in the reliability of our conclusion.

  • Received 5 January 1995


©1995 American Physical Society

Authors & Affiliations

S. V. Buldyrev, A. L. Goldberger, S. Havlin, R. N. Mantegna, M. E. Matsa, C.-K. Peng, M. Simons, and H. E. Stanley

  • Center for Polymer Studies and Department of Physics, Boston University, Boston, Massachusetts 02215
  • Cardiovascular Division, Harvard Medical School, Beth Israel Hospital, Boston, Massachusetts 02215
  • Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215
  • Department of Physics, Bar-Ilan University, Ramat Gan, Israel
  • Dipartimento di Energetica ed Applicazioni di Fisica, Palermo University, Palermo, I-90128, Italy

References (Subscription Required)

Click to Expand

Vol. 51, Iss. 5 — May 1995

Reuse & Permissions
Access Options
Physical Review E Scope Description to Include Biological Physics
January 14, 2016

The editors of Physical Review E are pleased to announce that the journal’s stated scope has been expanded to explicitly include the term “Biological Physics.”

Authorization Required




Sign up to receive regular email alerts from Physical Review E

Log In



Article Lookup

Paste a citation or DOI

Enter a citation