Large deviation principles for words drawn from correlated letter sequences

F. Hollander, den, J. Poisat

Samenvatting

When an i.i.d. sequence of letters is cut into words according to i.i.d. renewal times, an i.i.d. sequence of words is obtained. In the annealed LDP (large deviation principle) for the empirical process of words, the rate function is the speci¿c relative entropy of the observed law of words w.r.t. the reference law of words. In Birkner, Greven and den Hollander [3] the quenched LDP (= conditional on a typical letter sequence) was derived for the case where the renewal times have an algebraic tail. The rate function turned out to be a sum of two terms, one being the annealed rate function, the other being proportional to the speci¿c relative entropy of the observed law of letters w.r.t. the reference law of letters, obtained by concatenating the words and randomising the location of the origin. The proportionality constant equals the tail exponent of the renewal process. The purpose of the present paper is to extend both LDP’s to letter sequences that are not i.i.d. It is shown that both LDP’s carry over when the letter sequence satis¿es a mixing condition called summable variation. The rate functions are again given by speci¿c relative entropies w.r.t. the reference law of words, respectively, letters. But since neither of these reference laws is i.i.d., several approximation arguments are needed to obtain the extension.
Originele taal-2 Engels Eindhoven Eurandom 14 Gepubliceerd - 2013

Publicatie series

Naam Report Eurandom 2013007 1389-2355

Vingerafdruk

Duik in de onderzoeksthema's van 'Large deviation principles for words drawn from correlated letter sequences'. Samen vormen ze een unieke vingerafdruk.