Detection of embeddings in binary Markov chains

Yuriy S. Kharin; Egor V. Vecherko

doi:10.1515/dma-2016-0002

Article Publicly Available

Detection of embeddings in binary Markov chains

Yuriy S. Kharin and Egor V. Vecherko

Published/Copyright: April 30, 2016

Published by

Become an author with De Gruyter Brill

Author Information Explore this Subject

From the journal Discrete Mathematics and Applications Volume 26 Issue 1

Abstract

The paper is concerned with problems in steganography on the detection of embeddings and statistical estimation of positions at which message bits are embedded. Binary stationary Markov chains with known or unknown matrices of transition probabilities are used as mathematical models of cover sequences (container hles). Based on the runs statistics and the likelihood ratio statistic, statistical tests are constructed for detecting the presence of embeddings. For a family of contiguous alternatives, the asymptotic power of statistical tests based on the runs statistics is found. An algorithm of polynomial complexity is developed for the statistical estimation of positions with embedded bits. Results of computer experiments are presented.

Keywords: steganography; model of embeddings; Markov chain; statistical test; power; total number of runs

1 Introduction

The paper is concerned with a topical problem in steganographic information security—this is the problem of embedding detection, that is of the construction of statistical tests for the existence of embeddings and of statistical estimates of positions (points) of embeddings.

The problem of detection of embeddings was studied in [1, 2, 3, 4] under the assumption that the probabilistic model of a cover sequence is completely known. So, in [1] statistical tests were constructed for the embedding existence in the case when the initial (cover) sequence is modeled by a Bernoulli scheme of independent trials; it was also shown that the embedding detection is impossible if the fraction of the embeddings tends to 0 as the length of the initial sequence tends to to. A similar fact was proved in [3]. In [2] a most powerful statistical test for the embedding existence was constructed for the model based on a Bernoulli scheme of independent trials, and statistical estimates of the fraction of embeddings were put forward. Statistical estimates of the model parameters of the embedding in a binary Markov chain were constructed and examined in [5]; they allow to make preliminary conclusions on the fraction of embeddings. It is worth mentioning that the majority of studies on the detection of embeddings are based on empirical characteristics of sequences, which involve methods of discriminant analysis for testing the embedding existence. We also note that the above problems of recognition of embeddings are close to those on the detection of deviations of output sequences of cryptographic generators from uniformly distributed random sequences [6].

Our purpose in this paper is to continue the studies initiated in [5]: we construct and analyse statistical tests for the embedding existence, and to develop algorithms for the statistical estimation of embeddings points.

The paper is organized as follows. In § 2 we describe the mathematical (q, r)-block model of embedding in a binary Markov chain. In §3 we construct statistical tests for the embedding existence based on the runs statistics and on the short runs statistics, and in § 4 we consider tests based on the likelihood ratio statistics. In §5, we put forward an algorithm of polynomial complexity for statistical estimation. The results of numerical experiments are given in § 6.

2 Mathematical model of embedding

We define the generalized (q, r)-block model of embedding, a particular case of which was proposed by the authors of the present paper in [5]. Throughout, (Ώ, F, P) is the underlying probability space, V = {0,1} is the binary alphabet, V_T is the space of binary T-dimensional vectors, O (■) is the ‘big O’ notation introduced by Landau, N is the set of natural numbers, I{A} is the indicator of an event A, u^t₂_t₁ =(u_t₁,..., u_t₂ ) ∈ V_{t₂-t_l+1}(t₁, t₂ ∈ ℕ, t₁ < t₂) is a binary string of t₂ -t₁ + 1 successive symbols of some sequence {u_t : t ∈ ℕ}, w(•) is the Hamming weight, 𝔏{ξ} is the probability distribution of a random variable ξ, Β(θ) denotes the Bernoulli distribution with parameter θ ∈ [0, 1]: Ρ{ξ = 1} = 1 - Ρ{ξ = 0} = θ, Φ(•) is the distribution function for the standard normal law 𝒩(0, 1).

According to [5], an adequate model of the cover sequence for embedding a message is a binary sequence x^T₁= (x₁, x₂, ..., x_T) ∈ V_T, x_t ∈ V, t = 1, ..., T, of length T, which is a homogeneous hrst-order binary Markov chain with symmetric matrix of one-step transition probabilities P:

(1)P=P(ε)=12(1+ε1−ε1−ε1+ε), P{x1⊕xt+1}=12(1−ε), |ε|<1

Here, ε is the parameter of the model: the case ε = 0 corresponds to a scheme of independent trials which was examined in [1]. The case ε > 0 takes into account an attraction-type dependence, and the ε < 0, a repulsiontype dependence. We note that the Markov chain (1) satishes the ergodicity conditions [7] and has the uniform stationary distribution (1/2, 1/2). In what follows, we shall assume that the Markov chain (1) is stationary, and so its initial probability distribution agrees with the uniform distribution.

In practical applications [5] a message is subject to a cryptographic transformation before being embedded in the cover sequence, and hence we assume in what follows that a message ξ₁^M = (ξ₁, ..., ξ_M) ∈ V_M, M < T, is a sequence of M independent Bernoulli random variables:

(2)𝔏{ξt}=𝔅(θ1), P{ξt=j}=θj, j∈V, θ1=1−θ0, t=1, ..., M.

The stego-key γ₁^T = (γ₁ ..., γ_T) ∈ V_T specihes the points (time instants) at which the message bits ξ^Μ₁ are embedded in the sequence x₁^T. We introduce a special (q, r)-block model of the stego-key γ^T₁(q, r ∈ ℕ, r < q), assuming that the length of the sequence x^T₁ is a multiple of q: T = K_q.

Let ζ_k ∈ V, 𝔏{ζ_k} = 𝔅(δ), k = 1, ..., K, be auxiliary independent random variables, which govern the choice of the blocks {x_(k) = x^kq_(k-1)q+1} for embedding the message ξ^M₁: if ζ_k = 1, then r successive bits of the message are embedded in r randomly chosen bits of the block x_(k) if ζ_k = 0 then no embedding in the block x_(k) is performed; G^{(q, r)} = {g^{(q, r)}₁, ..., g^{(q, r)}_C^r_q} = {u^q₁ ∈ V_q : w(u^q₁) = r} is the set consisting of

C_q^r lexicographically ordered binary vectors of length q equipped with the Hamming weight r; g₁, g₂, ... are independent random variables, g_k has uniform probability distribution on the set {1, ..., C^r_q},

P{γ(k)=gi(q, r)|ζk=1}=P{gk=i}=1Cqr.

In the (q, r)-block model of embedding, the sequence γ^T₁ consists of blocks of length q: γ₍₁₎ = γ^q₁, γ₍₂₎ = γ^2q_{q + 1}, . . ., γ_(K) = γ^Kq_{(K - 1)q + 1},

(3)γk=(0, ...., 0), ⏟qζk=0, gi(q, r)∈G(q, r), ζk=1, gk=i, k=1, ..., T/q,

the parameter δ characterizes the fraction of embeddings. We note that for the (q, r)-block model of embedding the maximum capacity for the stego system is Tr/q bits, while the cardinality of the set of all possible stego-keys

Γq, r={G(q, r)∪{(0, ....0)}}T/q

is |Γ^{(q, r)}| = (1 + C^r_q)^T/q. In the case q = r = 1, we have the classical model [5] of a bit-wise embedding, |Γ^{(1, 1)}| = 2^T.

For the most commonly encountered in steganography methods of embedding (the ‘LSB replacement’ and the ‘± embedding’ [8]) the random stego-sequence Y^T₁ = (Y₁, ..., Y_T) is generated by the sequences {x_t}, {ξ_t}, {γ_t} via the function transform

(4)Yt=xt⊕γtxt⊕γtξTt={xt, γt=0, ξTt, γt=1,

where τ_t = ∑^t_{j = 1} γ_j. The sequences {x_t}, {ξ_t}, {γ_t} are assumed to be jointly independent.

We note that for r = 1 the model presented here coincides with the q-block model considered in [5].

From the practical point of view, the case with θ₀ = θ₁ = 1/2 in (2), which presents the greatest challenge for embedding detection, is the most noteworthy in the framework of the Markov model of embedding (1)-(4). In this case the one-dimensional distribution of probabilities is not distorted for an embedding in (4),

(5)P{Yt=1}=P{Yt=0}=P{xt=1}=P{xt=0}=1/2, t=1, 2, ....., T.

Another justihcation of the relevance of the case considered in the present paper is the practical utilization of preliminary cryptographic transformation of a message that removes the nonuniformity in the probability distribution of symbols.

3 Embedding detection based on the runs statistics

3.1 Using the total number of runs statistics

We introduce two hypotheses concerning the fraction δ ∈ [0, 1] of embeddings:

(6)H0:{δ=0}, H1:{δ>0}.

The hypothesis H₀ means that there are no embeddings and the stego-sequence Y^T₁ agrees with the cover sequence x^T₁. The composite alternative H₁ means there exist embeddings with some unknown fraction δ > 0. If the parameter of the cover sequence ε is known, then the null hypothesis, which will be denoted by H_{0, ε}, is simple; otherwise, H₀ is also a composite hypothesis. If the hypothesis H₀ holds, then the probability measure P will be denoted by P₀, otherwise, by P_δ. One similarly denotes the moments of random variables. The distributions P₀ and P_δ were found in [5].

Lemma 1. Under the hypothesisH_{0, ε}, the probability distribution of the stego-sequenceY^T₁ is as follows

P0{Y1T=y1T}=P0{x1T=y1T}=2−T(1−ε)BT−1(1+ε)T−BT,

where

BT=BT(y1T)=1+∑t=1T−1yt⊕yt+1

is the minimal sufficient statistics withH_{0, ε}.

The statistics B_T is called the ‘runs test’ in [9] (it means the total number of runs). By virtue of (1), under the hypothesis H₀ the sequence of indicators I{Y_t ⊗ Y_t+1 = 1} consists of independent random variables with Bernoulli distribution 𝔅(2^-1(1 - ε)). Using the exact binomial probability distribution of the statistics B_T with the known value of ε, one may construct a randomized statistical test for the embedding existence with the given probability of the hrst kind error α, . However, for practical purposes, it is more convenient to use its asymptotic variant as T → ∞, which is given by the critical region

(7)X1αB+={y1T:BT≥1+12T(1−ε)−12tαT(1−ε2)}forε>0,X1αB−={y1T:BT≤1+12T(1−ε)+12tαT(1−ε2)}forε<0,

where t_α is the α-quantile of the standard normal distribution: Φ(t_α) = α.

Theorem 1. Let the model of embedding (4) hold. Then asT → ∞ the asymptotic size of test (7) for the hypothesesH_{0, ε}, H₁based on the total number of runs statisticsB_Tcoincides with a preassigned significance levelα ∈ (0, 1). The asymptotic expression for the power of this test in the case of the(1, 1)-model of embedding and of the family of simple contiguous alternativesH_{1, δ} : {δ = ρ/T^β}, β > 0, is as follows:

(8)W1B+=W1B−→1, 0<β<1/2, Φtα+2ρε1−ε2, β=1/2, α, β>1/2

Proof. Under the hypothesis H₀, the De Moivre-Laplace limit theorem implies that

(9)L0BT−1−12T(1−ε)12T(1−ε2)→N(0, 1)asT→∞

Hence, using (9) we have, as T → ∞,

P0{X1αB+}→α, P0{X1αB−}→α.

In the case q = r = 1, under the alternative H₁, it follows from (1), (2), (4), (5) that the initial hrst moment of the random variable B_T is equal to

Eδ{BT}=1+∑t=1T−1Eδ{Yt⊕Yt+1}=1+2−1(T−1)(1−(1−δ)2ε)

Using similar arguments, we calculate the initial second moment under the alternative H

Eδ{Bt2}=Eδ(1+∑t1=1T−1Yt1⊕Yt1+1)(1+∑t2=1T−1Yt2⊕Yt2+1==Eδ∑t1, t2=1T−1(Yt1⊕Yt1+1)(Yt2⊕Yt2+1)+2Eδ{BT}−1==3Eδ{BT}−2+2∑t=1T−2∑h∈VPδ{Yt=h, Yt+1=1−h, Yt+2=h}++2∑τ=2T−2∑t=1T−τ−1∑h1, h2∈VPδ{Yt=h1, Yt+1=1−h1, Yt+τ=h2, Yt+τ=1−h2}==3Eδ{BT}−2+2−1(T−2)(1+ε(ε−2)(1−δ)2)++4∑τ=2T−2(T−τ−1)(Pδ{Yt=0, Yt+1=1, Yt+τ=0, Yt+τ+1=1}+Pδ{Yt=0, Yt+1=1, Yt+τ=1, Yt+τ+1=0})==1+2−13(T−1)(1−ε(1−δ)2)+2−1(T−2)(1−ε(ε−2)(1−δ)2)++2−2(T−2)(T−3)(1+ε(εδ2−2εδ−2+ε)(1−δ)2).

For the variance, we have

Dδ{BT}=14T(1−(1−δ)2ε2(1−6δ+3δ2)−14(1−(1−δ)2ε2(1−10δ+5δ2))==T(14(1−(1−δ)2ε2(1−6δ+3δ2)))(1+o(1)), T→∞.

By the construction (4) the random sequence {Y_t} satishes the strong mixing property [10, 11], and hence the central limit theorem for weakly dependent random variables holds,

(10)Lδ{BT−1−12T(1−(1−δ)2ε)Dδ{BT}}→N(0, 1)

In the case ε > 0, we take into account (10) and substitute δ = ρ/T^β as T → ∞ into the expression for the power. As a result, we have

limW1B+=limPδ{X1B+}=limPδ{BT≥1+12T(1−ε)−12tαT(1−ε2)}==limPδBT−Eδ{BT}Dδ{BT}≥1+12T(1−ε)−Eδ{BT}−12tαT(1−ε2)Dδ{BT}==ΦlimTεδ(2−δ)+tαT(1−ε2T(1−(1−δ)2ε2(1−6δ+3δ2))

Analyzing various values of β in this expression, we arrive at (8). The case ε < 0 is dealt with similarly. □

3.2 Using the short runs statistics

Let us construct the sequence of indicators of sign changes in the sequence Y₁, ..., Y_T ∈ V_T:

(11)zt=Yt⊕Yt+1∈V, t=1, ..., T−1.

Next, we define the set of patterns in sequence (11):

{𝔟1, 𝔟2, ...}, 𝔟τ=(1, 0, ..., 0, ︸τ1), τ∈ℕ∪{0};

here 𝔟_τ is the chain of τ successive 0’s bounded from the left and right by 1’s. Such patterns specify series of 0’s and 1’s of length τ + 1 in the stego-sequence {Y_t}. Further, we consider the disjoint random events ℭ_τ, τ ∈ ℕ ∪ {0}:

ℭτ={(zt, zt+1, ..., zt+τ+1)=𝔟τ}.

Lemma 2. Let the model of embedding (4) hold, q = r = 1. Then under the alternativeH₁ the probability distribution of the random events ℭ_τ is given by

(12)Pδ{Cτ}=P0{Cτ}+aτ(δ, ε)=2−(τ+2)(1+ε)τ(1+ε)2+aτ(δ, ε),

where 𝔞_τ(δ, ε) → 0 as δ → 0, |ε| < 1.

Proof. Using the law of total probability for the model of embedding under consideration we hnd that

Pδ{Cτ}=∑u1τ+2∈Vτ+2Pδ{γtt+τ+1=u1τ+2}Pδ{(zt, zt+1, ..., zt+τ+1)=bτγtt+τ+1=u1τ+2}==(1−δ)τ+2P0{Cτ}−δ∑u1τ+2∈Vτ+2:ω(u1τ+2)>0δω(u1τ+2)−1(1−δ)τ+2−ω(u1τ+2)××Pδ{zt, zt+1, ..., zt+τ+1)=bτγtt+τ+1=u1τ+2}δ→0→2−(τ+2)(1+ε)τ(1−ε)2.

Theorem 2. Under the hypotheses of Lemma 2, the function 𝔞_τ(δ, ε) has the asymptotic expansion

(13)aτ=δaτ(1)(ε)+O(δ)2, aτ(1)(ε)=2−1ε(2−ε), τ=0, 2−2ε(1−ε)(1+ε), τ=1, 2−τ−1ε(1−ε)(1+ε)τ−2(ε2+(τ+1)ε−τ+2), τ≥2.

Proof. We partition the set 𝔘_{τ+2, 1} = {u^τ+2₁ = (u₁, ..., u_τ+2) ∈ V_τ+2: w(u^τ+2₁) = 1}, |𝔘| = τ + 2, of binary vectors of length τ + 2, τ ≤ 3, with unit Hamming weight into three disjoint subsets:

Uτ+2, 1=Uτ+2, 1(0)∪Uτ+2, 1(1)∪Uτ+2, 1(2), Uτ+2, 1(j)={u1τ+2∈Uτ+2, 1:uj+1+uτ+2−j=1}, j∈{0, 1}, Uτ+2, 1(2)={u1τ+2∈Uτ+2, 1:∑j=3τuj=1}.

Arguing as in the proof of Lemma 2, we have

(14)Pδ{Cτ}=P0{Cτ}−δ(τ+2)P0{Cτ}++δ∑j∈{0, 1, 2}∑u1τ+2∈Uτ+2, 1(j)Pδ{(zt, zt+1, ...zt+τ+1)=bτγtt+τ+1=u1τ+2}+O(δ2).

Let us consider the case τ ≥ 2. The subset 𝔘^(j)_{τ+2, 1}, j ∈ {0, 1, 2}, contains sequences u^τ+2₁ ∈ 𝔘_{τ+2, 1} such that the events ℭ_τ ∩ {γ^t+τ+l_t = u^τ+2₁} are equiprobable under the alternative H_l:

(15)Pδ{Cτ∩{γtt+τ+1=u1τ+2}}==δ(1−δ)τ+22−τ−3(1−ε)(1+ε)τ, u1τ+2∈Uτ+2, 1(0), δ(1−δ)τ+22−τ−3(1−ε)2(1+ε)τ, u1τ+2∈Uτ+2, 1(1), δ(1−δ)τ+22−τ−3(1−ε)2(1+ε2)(1+ε)τ−2, u1τ+2∈Uτ+2, 1(2)

Now (13) with τ ≥ 2 follows by substitution of (15) into (14). The case τ < 2 in (13) is considered similarly. □

Theorem 3. Under the hypotheses of Lemma 2, the function 𝔞_τ(δ, ε) has the second-order asymptotic expansion

aτ=δaτ(1)(ε)+δ2aτ(2)(ε)+O(δ3),

where

a0(2)(ε)=2−2ε(−2+ε), a1(2)(ε)=2−3ε(−1+4ε+ε2), a2(2)(ε)=2−4ε2(−7+10ε+ε2), a3(2)(ε)=2−5ε(1−12ε+2ε2+16ε3+ε4), aτ(2)(ε)=2−τ−2ε(1+ε)τ−4(τ−2+ε(2τ2−14τ+13)+2ε2(−2τ2+7τ−8)++2ε3(τ2−3τ+9)+ε4(5τ+2)+ε5), τ≥4.

The proof is similar to that of Theorem 2, the set of stego-keys 𝔘_{τ+2, 2} being split into classes of equiprobable events.

From Theorems 2, 3 it follows that under the alternative H_l (existence of embeddings) the probability distribution of the total number of runs of a given length differs from that distribution under the hypothesis H₀. In particular, for ε > 0 the probabilities of the events ℭ₀, ℭ₁, ℭ₂ increase as δ increases from 0 to 1, whereas for τ > τ_ε = 2 + ε(3 + ε)(1 - ε)^-1 the probability P_δ{ℭτ} decreases with the increasing of δ. This being so, we consider the statistics

(17)BT, 1=∑t=1T−2zt, BT, 2=∑t=1T−2ztzt+1,

where the statistics 𝓑_{T, 2} is the total number of series of 0’s and of 1’s of length 1 in the sequence {y_t}, and the statistics 𝓑_{T, 1} is related to the total number of runs statistics B_T by the relation 𝓑_{T, 1} = B_T - z_T-l - 1.

Using Theorem 1 from [5] one may show that under the alternative H_l the initial ftrst-order moments of the bivariate statistics (𝓑_{T, 1}, 𝓑_{T, 2}), as given by (17), read as

(18)Eδ{BT, 1}=(T−2)12(1−(1−δ)2ε)=E0{BT, 1}+T12δ(2−δ)ε+o(T), T→∞, Eδ{BT, 2}=(T−2)14(1−(1−δ)2ε(2−ε))=E0{BT, 2}+T14δ(2−δ)ε(2−ε)+o(T), T→∞.

From (18) it is seen that for ε > 0 the mean number of sign changes or of two neighbouring sign changes is larger when the embeddings exist than in the opposite case.

Theorem 4. Let the model of embedding (4) holds. Then, asT → ∞, the statistical test for the hypothesesH_{0, ε}, H₁of the asymptotic significance level α ∈ (0, 1) based on the bivariate statistics (17) is given by the critical region

(19)X1αB1, 2={y1T:(BT, 1, BT, 2)∈D1, 2},

where the region 𝓓_{1, 2}is as follows

(20)D1, 2=(BT, 1, BT, 2):(BT, 1−Tμ0, 1)ε≥0, (BT, 2−Tμ20, 1)ε≥0, BT, 1−μ0, 1BT, 2−μ0, 12′(5−3ε)(1−ε)16−1−ε4−1−ε414BT, 1−μ0, 1BT, 2−μ0, 12≥TC1, 2, μ0, 1=12(1−ε), C1, 2=2−5(1−ε2)2ln⁡π−arccos21−ε5−3ε2πα,

that is,

P0{X1αB1, 2=P0{(BT, 1, BT, 2)∈D1, 2}→α}.

Proof. Using the fact that under the hypothesis H_{0, ε} the random variables {z_t} are independent and have the Bernoulli distribution 𝔅(2^-1(1 - ε)), and since the random variables z_t z_t+1 and z_sz_s+1 are independent if |t - s| > 1, we hnd that

E0{BT, 1}=T12(1−ε)(1+o(1)), E0BT, 2=T141−ε21+o1, D0{BT, 1}=(T−2)D0{Zt}=T14(1−ε2)(1+o(1)), D0{BT, 2}=(T−2)D0{ztzt+1}+2∑1≤t<s≤T−2cov0{ztzt+1, zszs+1}==(T−2)D0{ztzt+1}+2(T−3)cov0{ztzt+1, zt+1zt+2}==(T−2)(14(1−ε)2−116(1−ε)4)+2(T−3)(18(1−ε)3)−116(1−ε)4)==T116(1−ε2)(1−ε)(5−3ε)(1+o(1)), cov0{BT, 1, BT, 2}=∑t, s=1T−2cov0{zt, zszs+1}==(T−2)cov0{zt, ztzt+1}+(T−3)cov0{zt+1, ztzt+1}==(2T−6)(14(1−ε)2−18(1−ε)3)=T14(1−ε2)(1−ε)−34(1−ε2)(1−ε)==T14(1−ε2)(1−ε)(1+o(1)).

Next, since the sequence of pairs (Z_t, Z_tZ_t+1) ∈ V₂ is 1-dependent, it follows that as T → ∞ the random vector 1T(BT, 1−12T(1−ε), BT, 2−14T(1−ε)2)′ has an asymptotic normal distribution 𝓝₂((0, 0))′, Σ₀), where

Σ0=(1−ε2)141−ε41−ε4(5−3ε)(1−ε)16.

In Fig. 1 the region 𝓓_{1, 2} for the case ε > 0 is marked by the ‘+’ sign. Such a form of the domain follows from the asymptotical normality of the bivariate statistics (𝓑_{T, 1}, 𝓑_{T, 2}) and from expressions (18). To calculate the probability of the hrst kind error, we use the linear transform of the region 𝓓_{1, 2}. We apply the matrix Σ^{- 1/2}₀ to the unit vectors (1, 0), (0, 1) ∈ 𝓡² and construct the Gram matrix:

u1=Σ0−12(1, 0)′, u2=Σ0−12(0, 1)′, u1′u1u1′u2u1′u2u2′u2=Σ0−1=26(1−ε2)2(5−3ε)(1−ε)16−1−ε4−1−ε414.

Figure 1

The region 𝓓_{1, 2} for ε > 0 and the scattering ellipses for 5 e [0, 1].

The angle φ between the vectors u_l and u₂ is expressed in terms of the coefficient of correlation:

ϕ=arccosu1′u2u1u2=π−arccos⁡(corr0{BT, 1, BT, 2})=π−arccos21−ε5−3ε.

Because of the joint asymptotic normality of statistics (17), the random variable

Q1, 2=1TBT, 1u0, 1BT, 2u0, 12′Σ0−1BT, 1u0, 1BT, 2u0, 12

has an asymptotically exponential distribution with the parameter 1/2 as T → ∞. Hence, from the equation

P0{(BT, 1, BT, 2)∈D1, 2}=P0Q1, 2≥cϕ2π=(π−arccos⁡(21−ε5−3ε))2πec/2=α

we find c=26(1−ε2)2c1, 2 (the ellipse equation in Fig. 1: Q12 = c). The case ε < 0 is considered similarly with u1=Σ0−12(−1, 0)′, u2=Σ0−12(0, −1)′.

Lemma 3. Under the (1, 1)-model of embedding and the alternativeH_lthe random variablesz_t, z_sare independent if |t - s| ≥ 2, the random variablesz_t, z_sz_s+lare independent if |t - s| ≥ 2, and the random variables z_tz_{t + 1}, z_sz_{s + 1} are independent if |t - s| ≥ 3.

Proof. Let us consider the random variables z_t, z_t+k, k ≥ 2, and find the expectation E_δ{z_tz_t+k}, k ≥ 2:

Eδ{ztzt+k}=Pδ{ztzt+k=1}=2Pδ{Yt=0, Yt+1=1, Yt+k=0, Yt+k+1=1}++2Pδ{Yt=0, Yt+1=1, Yt+k=1, Yt+k+1=0}==2∑u∈v4Pδ{(Yt, Yt+1, Yt+k, Yt+k+1)=(0, 1, 0, 1), (γt, γt+1, γt+k, γt+k+1)=u}++2∑u∈v4Pδ{(Yt, Yt+1, Yt+k, Yt+k+1)=(0, 1, 0, 1), (γt, γt+1, γt+k, γt+k+1)=u}==26∑c∈{1, −1}((1−δ)4(1−ε)2(1−cεk−1)+δ(1−δ)3(2(1−ε)(1−cεk−1)+2(1−ε)(1+cεk))++δ2(1−δ)2(6−2ε−cεk−1+2cεk−cεk+1)+4δ3(1−δ)+δ4)=12(1−ε(1−δ)2)2=(Eδ{zt})2.

Since the random variables z_t, z_t+k are binary and since cov, {z_t, z_t+k} = 0 for _k ≥ 2, then such variables are independent. A similar argument shows that the random variables z_t, z_sz_s+l are independent if |t - s| ≥ 2 and that the random variables z_tz_t+l, z_sz_s+l are independent if |t - s| ≥ 3. □

Now we will employ Lemma 3 to hnd asymptotic expressions for the hrst and second moments of the bivariate statistics (𝓑_{T, 1}, 𝓑_{T, 2}) under the alternative H_{1, δ}. The hrst-order moments were found in (18). In the course of the proof of Theorem 1 it was shown that

Dδ{BT, 1}=T14(1−(1−δ)2ε2(1−6δ+3δ2))(1+o(1)), T→∞.

In view of Lemma 3 we have, as T → ∞,

covδ{BT, 1, BT, 2}=∑t, s=1T−2covδ{zt, zszs+1}==(T−2)covδ{zt, ztzt+1}+(T−3)covδ{zt, zt+1zt+2}++(T−3)covδ{zt, zt−1zt}+(T−4)covδ{zt, zt−2zt−1}==2T(covδ{zt, ztzt+1}+covδ{zt, zt+1zt+2}(1+o(1)), covδ{zt, ztzt+1}=Pδ{ztzt+1=1}(1−Pδ{zt=1})==14(1−(1−δ)2ε(2−ε))(1−12(1−(1−δ)2ε))==18(1−(1−δ)2ε(1−ε)−(1−δ)4ε2(2−ε)), covδ{zt, zt+1zt+2}=Pδ{zt, zt+1zt+2=1}−Pδ{zt=1}Pδ{zt+1zt+2=1}.

Using the law of total probability, we hnd, for the model of embedding (1, 1),

Pδ{ztzt+1zt+2=1}=2Pδ{Yt=0, Yt+1=1, Yt+2=0, Yt+3=1}==2∑u∈V4Pδ{Yt=0, Yt+1=1, Yt+2=0, Yt+3=1, γtt+3=u}==18(1−(1−δ)2ε(3−2ε+ε3)+(1−δ)4ε2), covδ{zt, zt+1zt+2}=pδ{ztzt+1zt+2=1}−Pδ{zt=1}Pδ{zt+1zt+2=1}==Pδ{ztzt+1zt+2=1}−18(1−(1−δ)2ε(3−ε)+(1−δ)4ε2(2−ε))==18(1−δ)2δ(2−δ)ε2(1−ε).

We thus have

covδ{BT, 1, BT, 2}=T14(1−(1−δ)2ε(1−ε)2−(1−δ)4ε2(3−2ε))(1+o(1)).

Using Lemma 3 as T → ∞ we hnd the variance DÒ{Bf2}:

Dδ{BT, 2}=(T−2)Dδ{ztzt+1}+2(T−3)covδ{ztzt+1, zt+1zt+2}++2(T−4)covδ{ztzt+1, zt+2zt+3}, Dδ{ztzt+1}=Pδ{ztzt+1=1}(1−Pδ{ztzt+1=1}==116(1−(1−δ)2ε(2−ε))(3+(1−δ)2ε(2−ε)), covδ{ztzt+1, zt+1zt+2}=Pδ{ztzt+1zt+2=1}−(Pδ{ztzt+1})2==116(1−(1−δ)22ε(1−ε+ε2)−(1−δ)4ε2(2−4ε+ε2)), covδ{ztzt+1, zt+2zt+3}=Pδ{ztzt+1zt+2zt+3=1}−(Pδ{ztzt+1=1})2.

A similar argument as for P_δ {z_tz_t+1z_t+2 = 1} shows that

Pδ{ztzt+1, zt+2zt+3=1}=2Pδ{Yt=0, Yt+1=1, Yt+2=0, Yt+3=1, Yt+4=0}==2∑u∈V5Pδ{Yt=0, Yt+1=1, Yt+2=0, Yt+3=1, Yt+4=0, γtt+4=u}==116(1−(1−δ)2(ε(4+δ3)−3ε2(2−2δ+δ2)+ε3(4−4δ+2δ2−δ3)−ε4)), covδ{ztzt+1, zt+2zt+3}=116(1−δ)2δε(1−ε)(−δ2+ε(2−δ−δ2)−ε2(2−δ)).

As a result, we have

DδBT, 2=T116(5−(1−δ)2(2(4+δ3)ε+2(1−10δ+5δ2)ε2−−2(4−16δ+8δ2+δ3)ε3+(3−10δ+δ2)ε4))(1+o(1)).

Using the strong mixing property [10], one may show that under the alternative H_{1, δ} the distribution of the random vector

1T(BT, 1−12T(1−(1−δ)2ε), BT, 2−14T(1−1−δ)2ε(2−ε)))′

as T → ∞ is asymptotically normal 𝓝₂((0, 0)′, Σ₁) with zero mean and covariance matrix Σ₁ = (σ_{1, ij}), i, j = 1, 2, where

σ1, 0014(1−(1−δ)2ε2(1−6δ+3δ2)), σ1, 01=σ1, 10=14(1−(1−δ)2ε(1−ε)2−(1−δ)4ε2(3−2ε)), σ1, 11=116(5−(1−δ)2(2(4+δ3)ε+2(1−10δ+5δ2)ε2−−2(4−16δ+8δ2+δ3)ε3+(3−10δ+δ2)ε4)).

Unfortunately, for the test (19) based on the short runs statistics we have not succeed to obtain an explicit expression for the power and to examine it, because the covariance matrix depends on δ. This dependence is illustrated in Fig. 1, which depicts the scattering ellipses (corresponding to the asymptotic matrices) when the parameter δ is increasing from 0 to 1. The following important property of the asymptotically normal distribution of the random vector (17) under the alternative H_{1, δ} is worth pointing out: with 5 changing from 0 to 1 the centre of the asymptotically normal distribution of the bivariate statistics (𝓑_{T, i}, 𝓑_{T, 2}) always lies on the line

(22)b1=12TεΔ+12T(1−ε), b2=14Tε(2−ε)Δ+14T(1−ε)2, Δ=δ(2−δ).

Taking into account the property (22), we construct a statistical test for the hypotheses H_{0, ε}, H₁ based on the statistics obtained as the orthogonal projection of the statistics (𝓑_{T, i}, 𝓑_{T, 2}) on the line (22). Such a test for ε > 0 is given by the critical region

(23)X1αh+={y1T:BT, 1+12(2−ε)BT, 2≥12T(1−ε)2(2−ε)−tαTdh}, dh=2−6(1−ε2)(68−100ε+65ε2−20ε3+3ε4).

Theorem 5. Let the model of embedding (4) hold and let ε > 0. Then, asT → ∞, the asymptotic size of test (23) for the hypothesesH_{0, ε}, H₁based on the projection of the short runs statistics

(24)h=BT, 1−12T(1−ε)+12(2−ε)(BT, 2−14T(1−ε)2)

coincides with the significance level α ∈ (0, 1). The asymptotic power of this test for the (1, 1)-model of embedding and for the family of contiguous alternativesH1, δ:{δ=ρT}is as follows:

(25)W1h+→Φtα+ρε(1+14(2−ε)2)dh, T→∞.

Proof. The angle between the line (22) and the b₂-axis is φ = arctan (1/2(2 - ε)), and hence, the orthogonal projection of the point (𝓑_{T, i}, 𝓑_{T, 2}) on this line is given by

(BT, 1−12T(1−ε))cos⁡ϕ+(BT, 1−12T(1−ε))sin⁡ϕ.

Multiplying this expression by cosec ϕ, we get the random variable 𝔥, which, according to (21), has the asymptotically normal distribution 𝓝₁(0, d_#x1D525;) under the hypothesis H_{0, ε}. Hence, P₀{X^{𝔥 +}_{1 α}} → α as T → ∞.

Let us hnd the power of test (23) as T → ∞ for contiguous alternatives of the form indicated in the theorem. We have

W1h+=Pδ{BT, 1+12(2−ε)BT, 2≥12T(1−ε)+18T(1−ε)2(2−ε)−tαTdh}=Pδ{BT, 1+12(2−ε)BT, 2−Eδ{BT, 1}−12(2−ε)Eδ{BT, 2}≤≤12Tδ(2−δ)ε+18Tδ(2−δ)ε(2−ε)2+tαTdh}→→Φlim12Tδ(2−δ)ε+18Tδ(2−δ)ε(2−ε)2+tαTdh}T(σ1, 00+14(2−ε)2σ1, 11+(2−ε)σ1, 01).

Substituting δ=ρT in this expression on as T → ∞, we find that

W1h+→ΦlimTρε+14Tρε(2−ε)2+tαTdh+O(1)T(dh+O(1T))→Φtα+ρε(1+14(2−ε)2)dh.

4 Embedding detection on the basis of the likelihood ratio statistics

Let us now consider the case when the parameter ε in (1) is unknown and separated from the zero: ε₀ ≤ |ε| < 1, where ε₀ > 0 is the known boundary value.

We construct the likelihood function for the observed stego-sequence y^T₁ ∈ V_T. Following [5], we partition the set V_t of binary t-dimensional vectors into t + 1 disjoint subsets,

(26)Vt=Γ0(t)∪Γ1(t)∪....∪Γt(t),

where

(27)Γ0(t)={u1t∈Vt:ut=1}, Γ1(t)={u1t∈Vt:ut−1=ut=0}, Γj(t)={u1t∈Vt:ut−j=0, ut−j+1=...=ut−1=1, ut=0}, 1<j<t, Γt(t)={u1t∈Vt:u1=...ut−1=ut=1}.

The partition (26), (27) generates the partition of all possible trajectories of fragments of the key sequence γ^t₁ = u^t₁ ∈ V_t.

Lemma 4. The likelihood function for the (q, r)-block model of embedding is as follows

L(ε, δ)=Pδ{Y1t=y1T}=2−T∑u1T∈Γ(q, r)(1−δ)b0(u1T)(δ/Cqr)br(u1T)∏t=1Tφt(u1t, y1t),

where

φt(u1t, y1t)=1, u1t∈Γ0(t), 1+(−1)yt−j+ytεj, u1t∈Γj(t), 1≤j<t, 1, u1t∈Γt(t).

The proof is similar to that of Theorem 5 for the q-block model of embedding in [5].

To test the hypotheses H₀, H₁ on the existence of embeddings we now construct the statistical likelihood ratio test [12]. The statistics λ_T of this test for the hypotheses H₀, H₁ takes the form

(28)λT=λT(y1T)=−2ln⁡L(ε^, 0)max{L(ε^1, δ^1), L(ε^, 0)}≥0,

where ε^, (ε^1, δ^1) are the maximum-likelihood estimates, which were constructed in [5] under the hypotheses H₀ and H₁ respectively. The statistics (28) introduced above is equivalent to the likelihood ratio statistics

sup|ε|<1, δ>0Pδ{y1, ..., yT}sup|ε|<1P0{y1, ..., yT}.

Besides, according to [5],

arg⁡max|ε|<1, δ>0Pδ{y1, ..., yT}=(ε^1, δ^1), arg⁡max|ε|<1P0{y1, ..., yT}=ε^.

The statistical test of size α ∈ (0, 1) based on the statistics λ_T is defined by the critical region

(29)X1αλ={y1T∈VT:λT≥λα},

where λ_α > 0 is the solution of the equation

(30)supε0≤|ε|<1P0{λT≥λ}=supε0≤|ε|<1(1−F0(ε, T, λT))=α.

Here, F₀(ε, T, λ_T) is the distribution function of the statistics (28) under the null hypothesis H₀.

To estimate the value of λ_α satisfying (30), we use the Monte Carlo method: we model M₀ samples of a Markov chain of length T with the parameter ε₀. For each sample we calculate the value of the statistics by (28). Let λ⁽¹⁾, ..., λ^(M₀) be the calculated values. Then λ_α can be estimated by the sample quantile of level 1 - α:

(31)λ^α=λ([(1−α)M0]);

the accuracy of this estimate increases with M₀ → ∞. So, the statistical tests (29) for the embedding existence assumes the formml:

the hypothesis H₀ (respectively, H₁) is adopted if p ≥ α (p < α),

p=1M0+1(1+∑i=1M0I{λ(i)>λT}).

The available asymptotic properties of the likelihood ratio test [12, 13] may be used under the regularity conditions [12] guaranteeing the existence, uniqueness, and asymptotic normality of the maximum likelihood estimates of the parameters ε and δ.

Theorem 6. Under the model of embedding (4), asT → ∞ the test of asymptotic significance level α ∈ (0, 1) based on the likelihood ratio statistics for the composite null hypothesis is given by the critical region (29) with the threshold λ_α = χ²_{1-α, 1}; that is,

P0X1αλ=P0λT≥χ1−α, 12→α.

This test is consistent under fixed alternatives δ = δ₁ > 0:

W1λ=Pδ{X1αλ}→1.

The proof follows the argument of [13] with the use of the central limit theorem for weakly dependent random variables [10].

5 5 Statistical estimation of embeddings points

If the alternative H₁ is adopted, then there arises the problem of estimation of points of embeddings—these being the time instants t ∈ {1, ..., T} at which in accordance with (4) a bit of the sequence {x_t} is replaced by a bit of the hidden message {𝔏_t}.

Theorem 7. Letγ^T₁ = (γ₁, ..., γ_T) ∈ Γ^{(q, r)}be the key sequence corresponding to the(q, r)-model of embedding, y^T₁ ∈ V_T be the observed stego-sequence, γ^1T=f(y1T)is some statistical estimate of the key sequence γ^T₁basedon observationsy^T₁. The minimum of the error probability in estimating the stego-key

Pδ{γ^1T≠γ1T}→min

is attained for the statistics

(32)γ^1T∗=arg⁡maxu1T∈Γ(q, r)Pδ{γ1T=u1T|Y1T=y1T},

which maximizes the a posteriori probability of the stego-key. The minimum of error probability is as follows:

(33)r∗(ε, δ, T)=minf(.)Pδγ^1T≠γ1T==1−∑y1T∈VTPδY1T=y1Tmaxu1T∈Γ(q, r)Pδγ1T=u1TY1T=y1T.

Proof. We choose an arbitrary statistics

(34)γ^1T=f(Y1T):VT→Γ(q, r),

and calculate for it the corresponding error probability for the estimate of the true stego-key error probability for the estimate of the true stego-key γ^T₁ ∈ Γ^{(q, r)}:

r(f;ε, δ, T)=Pδγ^1T≠γ1T=1−Pδγ^1T=γ1T.

After equivalent transformations, using (34) and the rlaw of total probability, we hnd that

(35)r(f;ε, δ, T)=1−∑u1T∈Γ(q, r)Pδγ^1T=γ1T, γ1T=u1T=1−∑u1T∈Γ(q, r)∑y1T∈VTPδfY1T=γ1T, γ1T=u1T, Y1T=y1T==1−∑y1T∈VT∑u1T∈Γ(q, r)PδY1T=y1TPδγ1T=u1TY1T=y1T×Pδf(Y1T)=γ1Tγ1T=u1T, Y1T=y1T==1−∑y1T∈VTPδY1T=y1T∑u1T∈Γ(q, r)If(y1T)=u1TPδγ1T=u1TY1T=y1T.

Minimizing this expression in f(•) and using (34), we obtain the optimal function f(•) in the form

(36)f∗y1T=arg⁡maxu1T∈Γ(q, r)Pδγ1T=u1TY1T=y1T,

which agrees with the statistics (32).

Substituting (36) into (35), we get (33). □

The estimate (32) by the maximum a posteriori probability criterion admits the following equivalent representation, which is convenient for its evaluation:

(37)γ^1T∗=arg⁡maxu1T∈Γ(q, r)Pδγ1T=u1TY1T=y1T=arg⁡maxu1T∈Γ(q, r)Pδγ1T=u1T, Y1T=y1T.

The solution of problem (37) for the (q, r)-block model of embedding by the exhaustive search has a computational complexity O(T(1+Cqr)T/q).. Let us hnd a polynomial algorithm for solving this problem on the basis of the classical Viterbi algorithm [14].

We set

stut−c, ..., ut=maxu1, ..., ut−c−1∈VlogPδY1t=y1t, γ1=u1, ..., γt=ut, c=max{2r+1, q−1}.

The initial values of 𝔰_t(u₁, ..., u_t) with t = 1, ..., c are as follows:

(38)s1(u1)=logφ1(u1, y1)+logPδγ1=u1, st(u1, ..., ut)=st−1(u1, ..., ut−1)+logφt(u1t, y1t)++logPδγt=utγt−1=ut−1, ..., γ1=u1, 2≤t≤c;

here, φ_t(˙) is the same as in Lemma 4.

Theorem 8. Under the(q, r)-block model of embedding (4), q > r, the recurrence relation

(39)st(ut−c, ..., ut)==maxut−c−1∈Vst−1(ut−c−1, ut−c, ..., ut−1)+logft(ut−2r−1t, yt−2r−1t)++logPδγt=utγt−1=ut−1, ..., γt−c=ut−c

holds for 𝔰_t(u_{t - c}, ..., u_t) with t > c, where

ft(ut−2r−1t, yt−2r−1t)=12, u1t∈Γ0(t), 121+(−1)yt−j+ytεj, u1t∈Γj(t), 1≤j≤2r+1

Proof. In the case q ≤ 2r + 2 we have

stut−2r−1, ..., ut=maxu1, ..., ut−2r−2∈VlogPδY1t=y1t, γ1=u1, ..., γt=ut==maxu1, ..., ut−2r−2∈VlogPδY1t−1=y1t−1, Yt=yt, γ1=u1, ..., γt−1=ut=1, γt=ut==maxu1, ..., ut−2r−2∈VlogPδY1t−1=y1t−1, γ1=u1, ..., γt−1=ut=1++logPδγt=utγ1=ut|γ1=u1, ..., γt−1=ut−1+++logPδ{Yt=yt|Y1t−1=y1t−1, γ1=u1, ..., γt=ut}.

The case q > 2r + 2 is dealt with similarly. Combining these cases, we arrive at (39). □

Corollary 1. Under the hypotheses of Theorem 8 the estimateγ^1T=(γ^1, ..., γ^T) of the stego-key by the maximum a posteriori probability criterion is as follows

(40)γ^T−C, ..., γ^T=arg⁡maxuT−c, ..., uT∈VsT(uT−c, ..., uT), γ^t=arg⁡maxv∈Vst+c(v, γ^t+1, ..., γ^t+c), t=T−c−1, ..., 1.

Proof. The estimate γ^1T=(γ^1, ..., γ^T) of the stego-key is obtained as the reverse execution of the algorithm for finding maxuT−c, ..., uT∈VsT by (38), (39). □

The algorithm of the estimation of embedding points (the forward run (38), (39), the backward run (40)) has a numerical complexity O(2c+(T−c)22c+2).

Having estimate the embedding points γ^T₁ by (40), one can construct an estimate ξ^ of the message itself:

ξ^τ=ytτ, wheretτ=mint∈{1, ..., T}{t:∑k=1tγ^k=τ}, τ=1, ..., ω(γ^1T).

6 Results of computer experiments

We give the results of three series of computer experiments using simulated data.

Series 1. The initial Markov sequence (1) of length T = 10⁴ with the parameter ε = 0.13 was simulated. For q = r = 1, the key Bernoulli sequence was simulated using (3) with various values of the parameter δ ∈ [0, 1], the stego-sequence y^T₁ was constructed by (4). Figure 2 depicts the total number of runs statistics B_T versus the fraction of embeddings δ. Circles mark the values of the statistics 1T−1BT for the sequence y^T₁ thus constructed with the corresponding fraction of embeddings δ, the solid line shows the graph for the mean value 1T−1Eδ{BT}..

$Figure 2 The total number of runs statistics BT versus the fraction of embeddings δ$

Figure 2

The total number of runs statistics B_T versus the fraction of embeddings δ

Series 2. As in Series 1, the Monte Carlo method with the number of replications M₁ = 28 was used to construct estimates of powers for the tests (7), (23) under the hypotheses H_{0, ε}, H₁ with known cover sequence parameter ε = 0.48; the length T = 2¹³, the signihcance level α = 0.05, and the fraction of embeddings δ ∈ {0.005, 0.01, 0.015, 0.02, 0.025, 0.03, 0.04, 0.05, 0.06, 0.07}.

$Figure 3 Powers of the tests XB+1α, X𝔥 +1α versus the fraction of embeddings δ.$

Figure 3

Powers of the tests X^B+_1α, X^{𝔥 +}_1α versus the fraction of embeddings δ.

Figure 3 shows graphically the powers of the statistical tests (7), (23) versus the fraction of the embeddings δ. The black solid line depicts the theoretical curve of the test power (7) based on the total number of runs statistics, the grey solid line shows the theoretical curve of the test power (23) based on the projection of short runs statistics, the black circles correspond to estimates of the powers of test (7), the white circles show estimates of the powers of test (23). The 95%-conhdence intervals for the powers of tests (7) and (23) are shown in grey and black, respectively.

It is seen from the graph that test (23) based on the short runs statistics is more powerful than test (7) based on the total number of runs statistics. Numerical experiments show that for small values ε the powers of tests (7) and (23) are practically the same.

$Figure 4 Power of the test 𝔥λ1α versus the fraction of embeddings δr/q..$

Figure 4

Power of the test 𝔥^λ_1α versus the fraction of embeddings δr/q..

Series 3. For the block model of embedding with q = 2, r = 1, the Monte Carlo method was used to hnd the threshold estimates λ^α by (31) and the power of the statistical test (29) based of the likelihood ratio with the model parameters ε = 0.12; the length T = 2¹⁸, and the signihcance level α = 0.05. The threshold estimate was calculated with the number of replications M₀ = 500, the estimates of powers were calculated with the number of replications M_λ = 250, 100, 200, 150, 100 and the fraction of the actual embedding δr/q = δ/2, which equals 0.10, 0.15, 0.20, 0.25, 0.30, respectively. Figure 4 shows the graph of the power estimates for the test X^λ_1α versus the fraction of the actual embedding δ/2.

Computer experiments demonstrate the efficiency of the statistical test thus constructed for the embedding detection and the agreement between theoretical and experimental results.

In conclusion, we note that embeddings may also be detected using small-parametric models of highorder Markov chains [15].

Note: Originally published in Diskretnaya Matematika (2015) 27, N^o3, 123–144 (in Russian).

Acknowledgment

The authors are grateful to A. M. Zubkov for suggesting to study the runs statistics for the embedding detection and to the referees for comments and advices.

References

1 Ponomarev K. I., “A parametric model of embedding and its statistical analysis”, Discrete Math. Appl, 19:6 (2009), 587-596.10.1515/DMA.2009.039Search in Google Scholar

2 Ponomarev K. I., “On one statistical model of steganography”, Discrete Math. Appl., 19:3 (2009), 329-336.10.1515/DMA.2009.021Search in Google Scholar

3 Ker A., “A capacity result for batch steganography”, IEEE Signal Process. Lett., 14:8 (2007), 525-528.10.1109/LSP.2006.891319Search in Google Scholar

4 Shoytov A.M., “On the fact of detecting the noise in hnite Markov chain with an unknown transition probability matrix”, Prikl. Diskr. Mat., 2010, Na Supplement Na 3, 44-45 (in Russian).Search in Google Scholar

5 Kharin Yu. S., Vecherko E. V., “Statistical estimation of parameters for binary Markov chain models with embeddings”, Discrete Math. Appl., 23:2 (2013), 153-169.10.1515/dma-2013-009Search in Google Scholar

6 Zubkov A. M., “Pseudorandom number generatgors and its applications”, Proc. II Int. Sci. Conf. “Mathematics and security of information technologies”, 2003, 200-206.Search in Google Scholar

7 Kemeny J. G., Snell J. L., Finite Markov chains, Van Nostrand, 1960.Search in Google Scholar

8 Ivanov V.A., “Models of inclusions into homogeneous random sequences”, Tr. Diskr. Mat., 10 (2008), 18-34 (in Russian).Search in Google Scholar

9 A statistical test suite for random and pseudorandom number generators for cryptographic applications: NIST Special Publication 800-22 Rev. 1a., Nat. Inst. Stand. Technol., 2010.Search in Google Scholar

10 Doukhan P., Mixing: properties and examples, Springer-Verlag, 1994.10.1007/978-1-4612-2642-0Search in Google Scholar

11 Kharin Yu. S., Voloshko V. A., “Robust estimation of AR coefficients under simultaneously influencing outliers and missing values”, J. Statist. Plan. Infer., 141:9 (2011), 3276-3288.10.1016/j.jspi.2011.04.015Search in Google Scholar

12 Ivchenko G. I., Medvedev Yu. I., Mathematical statistics, Vysshaya shkola, Moscow, 1984 (in Russian).Search in Google Scholar

13 Wald A., “Tests of statistical hypotheses concerning several parameters when the number of observations is large”, Trans. Amer. Math. Soc., 54:3 (1943), 426-482.10.1090/S0002-9947-1943-0012401-3Search in Google Scholar

14 Rabiner L. R., “A tutorial on hidden Markov models and selected applications in speech recognition”, Proc. IEEE, 77:2 (1989), 257-286.10.1016/B978-0-08-051584-7.50027-9Search in Google Scholar

15 Kharin Yu. S., Petlitskiy A. I., “A Markov chain of order s with r partial connections and statistical inference on its parameters”, Discrete Math. Appl., 17:3 (2007), 295-317.10.1515/dma.2007.026Search in Google Scholar

Received: 2015-3-31

Published Online: 2016-4-30

Published in Print: 2016-4-1

Articles in the same Issue

https://doi.org/10.1515/dma-2016-0002

Keywords for this article

steganography; model of embeddings; Markov chain; statistical test; power; total number of runs