The time until the final zero crossing of random sums with application to nonparametric bandit theory
Document Type
Article
Publication Date
7-1994
Department
Department of Mathematical Sciences
Abstract
Motivated by problems in machine learning and more fundamentally, by non-Bayesian, nonparametric problems in the sequential design of experiments, this work contributes to the task of attaining probability bounds for the number of times suboptimal bandits are chosen in a nonterminating sequence of experiments. To our knowledge, previously only the growth of the expectation of the number of incorrect choices has been examined. The derivation is founded, in part, on new contributions to the theory of zero crossings for sums of biased independent, identically distributed (i.i.d.) random variables.
Publication Title
Applied Mathematics and Computation
Recommended Citation
Pinelis, I.,
&
Yakowitz, S.
(1994).
The time until the final zero crossing of random sums with application to nonparametric bandit theory.
Applied Mathematics and Computation,
63(2-3), 235-263.
http://doi.org/10.1016/0096-3003(94)90197-X
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/5575