Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Testing the order of Markov dependence in DNA sequences

dc.contributor.authorPardo Llorente, Leandro
dc.contributor.authorMenéndez Calleja, María Luisa
dc.contributor.authorPardo Llorente, María del Carmen
dc.contributor.authorZografos, Konstantinos
dc.date.accessioned2023-06-20T00:20:22Z
dc.date.available2023-06-20T00:20:22Z
dc.date.issued2011-03
dc.description.abstractDNA or protein sequences are usually modeled as probabilistic phenomena. The simplest model is created on the assumption that the nucleotides at the various sites are independently distributed. Usually the type of nucleotide at some site depends on the type at another site and therefore the DNA sequence is modeled as a Markov chain of random variables taking on the values A, G, C and T corresponding to the four nucleotides. First order or higher order Markov models provide better fit to a DNA sequence. Based on this remark, the aim of this paper is to present and study a family of test statistics for testing order Markov dependence in DNA sequences. This new family includes as a particular case the classical likelihood ratio test. A simulation study is presented in order to find test statistics, in this family, with a better behaviour than the likelihood ratio test.
dc.description.departmentDepto. de Estadística e Investigación Operativa
dc.description.facultyFac. de Ciencias Matemáticas
dc.description.refereedTRUE
dc.description.statuspub
dc.eprint.idhttps://eprints.ucm.es/id/eprint/17330
dc.identifier.doi10.1007/s11009-008-9107-1
dc.identifier.issn1387-5841
dc.identifier.officialurlhttp://www.springerlink.com/content/q2j600m165xg7r2n/fulltext.pdf
dc.identifier.relatedurlhttp://link.springer.com/
dc.identifier.urihttps://hdl.handle.net/20.500.14352/42420
dc.issue.number1
dc.journal.titleMethodology and computing in applied probability
dc.language.isoeng
dc.page.final74
dc.page.initial59
dc.publisherSpringer
dc.relation.projectIDMTM 2006-06872
dc.relation.projectIDHG2004-0012
dc.rights.accessRightsrestricted access
dc.subject.cdu61
dc.subject.cdu57
dc.subject.keywordDNA sequence
dc.subject.keywordMarkov dependence
dc.subject.keywordLikelihood ratio test
dc.subject.keywordPhi-divergence test statistics
dc.subject.keywordDivergence
dc.subject.keywordChain
dc.subject.ucmEstadística aplicada
dc.titleTesting the order of Markov dependence in DNA sequences
dc.typejournal article
dc.volume.number13
dcterms.referencesAvery PJ, Henderson DA (1999) Fitting Markov chain models to discrete state series such as DNA sequences. Appl Stat 48:53–61 Bejerano G, Friedman N, Tishhy N (2004) Efficient exact p-value computation for small sample, sparse and surprising categorical data. J Comput Biol 11:867–886 Bell GI, Sánchez-Pescador R, Laybourn PJ, Najarian RC (1983) Exon duplication and divergence in the human preproglucagon gene. Nature 304:368–371 Billingsley P (1961a) Statistical methods in Markov chains. Ann Math Stat 32:13–39 Billingsley P (1961b) Statistical inference for Markov processes. The University of Chicago Press, Chicago Ewens WJ, Grant GR (2005) Statistical methods in bioinformatics (2nd edn). Springer, New York. Hoel PG (1954) A test for Markov chains. Biometrika 14:430–433 Menéndez ML, Pardo JA, Pardo L (2001) Csiszar’s ϕ-divergences for testing the order in a Markov chain. Stat Pap 42:313–328 Menéndez ML, Pardo JA, Pardo L, Zografos K (2006) On tests of independence based on minimum φ-divergence estimator with constraints: an application to modeling DNA. Comput Stat Data Anal 51(2):1100–1118 Patel NR (2003) An exact test for homogeneity of a Markov chain. www.cytel.com Pardo L (2006) Statistical inference based on divergence measures. Chapman & Hall/CRC, New York Pardo L,Morales D, Salicrú M, MenéndezML (1993) The ϕ-divergence statistic in bivariate multinomial populations including stratification. Metrika 40:223–235 Read TRC, Cressie NAC (1988) Goodness-of-fit statistics for discrete multivariate data. Springer, New York Reinert G, Schbath S, Waterman MS (2000) Probabilistic and statistical properties of words: and overview. J Comput Biol 7:1–46 Zografos K (1993) Asymptotic properties of φ-divergence statistic and applications in contingency tables. Int J Math Stat Sci 2:5–21
dspace.entity.typePublication
relation.isAuthorOfPublicationa6409cba-03ce-4c3b-af08-e673b7b2bf58
relation.isAuthorOfPublication.latestForDiscoverya6409cba-03ce-4c3b-af08-e673b7b2bf58

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
PardoLeandro01.pdf
Size:
343.37 KB
Format:
Adobe Portable Document Format

Collections