20.1 Martingales and the method of mixtures 244Lemma20.3.Lethbe a probability measure onRd, thenM ̄t=∫
RdMt(x)dh(x)
is aF-adapted supermartingale withM ̄ 0 = 1.
The following theorem is the key result from which the confidence set will be
derived.Theorem20.4.For al lλ > 0 andδ∈(0,1),
P(
existst∈N:‖St‖^2 Vt(λ)− 1 ≥2 log(
1
δ)
+ log(
det(Vt(λ))
λd))
≤δ.The proof will be given momentarily. First though, the implications.Theorem20.5.Letδ∈(0,1). Then with probability at least 1 −δit holds that
for allt∈N,
‖θˆt−θ∗‖Vt(λ)<√
λ‖θ∗‖+√
2 log(
1
δ)
+ log(
detVt(λ)
λd)
.
Furthermore, if‖θ∗‖≤L, thenP(existst∈N+:θ∗∈C/ t)≤δwith
Ct={
θ∈Rd:‖θˆt− 1 −θ‖Vt− 1 (λ)≤√
λL+√
2 log(
1
δ)
+ log(
detVt− 1 (λ)
λd)}
.
Proof We only have to compare‖St‖Vt(λ)− 1 and‖θˆt−θ∗‖Vt(λ).‖θˆt−θ∗‖Vt(λ)=‖Vt(λ)−^1 St+ (Vt(λ)−^1 Vt−I)θ∗‖Vt(λ)
≤‖St‖Vt(λ)−^1 + (θ>∗(Vt(λ)−^1 Vt−I)Vt(λ)(Vt(λ)−^1 Vt−I)θ∗)^1 /^2
=‖St‖Vt(λ)− 1 +λ^1 /^2 (θ∗>(I−Vt(λ)−^1 Vt)θ∗)^1 /^2
≤‖St‖Vt(λ)− 1 +λ^1 /^2 ‖θ∗‖.And the result follows from Theorem 20.4.
Proof of Theorem 20.4 LetH=λI∈Rd×dandh=N(0,H−^1 ) andM ̄t=∫
RdMt(x)dh(x)=
1
√
(2π)ddet(H−^1 )∫
Rdexp(
〈x,St〉−1
2
‖x‖^2 Vt−1
2
‖x‖^2 H)
dx.Completing the square,〈x,St〉−1
2 ‖x‖2
Vt−1
2 ‖x‖2
H=1
2 ‖St‖2
(H+Vt)−^1 −1
2 ‖x−(H+Vt)− (^1) St‖ 2
H+Vt.
The first term‖St‖^2 (H+Vt)− 1 does not depend onxand can be moved outside the
integral, which leaves a quadratic ‘Gaussian’ term that may be integrated exactly
and results in
M ̄t=
(
det(H)
det(H+V)) 1 / 2
exp(
1
2
‖St‖^2 (H+Vt)− 1