36.2 Frequentist analysis 433We start with a straightforward decomposition:
E[Ti(n)] =E[n
∑t=1I{At=i}]
=E
[n
∑t=1I{At=i,Ei(t)}]
+E
[n
∑t=1I{At=i,Eic(t)}]
. (36.4)
In order to bound the first term letA′t= argmaxi 6 =1θi(t). ThenP(At= 1,Ei(t)|Ft− 1 )≥P(A′t=i,Ei(t),θ 1 (t)≥μ 1 −ε|Ft− 1 )
=P(θ 1 (t)≥μ 1 −ε|Ft− 1 )P(A′t=i,Ei(t)|Ft− 1 )≥G 1 T 1 (t−1)
1 −G 1 T 1 (t−1)P(At=i,Ei(t)|Ft− 1 ), (36.5)where in the first equality we used the fact thatθ 1 (t) is conditionally independent
ofA′tandEi(t) givenFt− 1. In the second inequality we used the definition of
G 1 sand the fact that
P(At=i,Ei(t)|Ft− 1 )≤(1−P(θ 1 (t)≥μ 1 −ε|Ft− 1 ))P(A′t=i,Ei(t)|Ft− 1 ),which is true since{At=i,Ei(t)occurs} ⊆ {A′t=i,Ei(t)occurs}∩{θ 1 (t)≤
μ 1 −ε}and the two intersected events are conditionally independent givenFt− 1.
Therefore using Eq. (36.5) we have
P(At=i,Ei(t)|Ft− 1 )≤(
1
G 1 T 1 (t−1)− 1
)
P(At= 1,Ei(t)|Ft− 1 )≤
(
1
G 1 T 1 (t−1)− 1
)
P(At= 1|Ft− 1 ).Substituting this into the first term in Eq. (36.4) leads toE
[n
∑t=1I{At=i,Ei(t) occurs}]
≤E
[n
∑t=1(
1
G 1 T 1 (t−1)−^1)
P(At= 1|Ft− 1 )]
=E
[n
∑t=1(
1
G 1 T 1 (t−1)−^1)
I{At= 1}]
≤E
[n− 1
∑s=0(
1
G 1 s− 1
)]
, (36.6)
where in the last step we used the fact thatT 1 (t−1) =sis only possible for one
round whereAt= 1. LetT ={t∈[n] : 1−FiTi(t−1)(μ 1 −ε)> 1 /n}. After some