Michigan Tech Publications, Part 1

The time until the final zero crossing of random sums with application to nonparametric bandit theory

I. Pinelis, Michigan Technological UniversityFollow
S. Yakowitz, The University of Arizona

Document Type

Article

Publication Date

7-1994

Department

Department of Mathematical Sciences

Abstract

Motivated by problems in machine learning and more fundamentally, by non-Bayesian, nonparametric problems in the sequential design of experiments, this work contributes to the task of attaining probability bounds for the number of times suboptimal bandits are chosen in a nonterminating sequence of experiments. To our knowledge, previously only the growth of the expectation of the number of incorrect choices has been examined. The derivation is founded, in part, on new contributions to the theory of zero crossings for sums of biased independent, identically distributed (i.i.d.) random variables.

Publication Title

Applied Mathematics and Computation

Recommended Citation

Pinelis, I., & Yakowitz, S. (1994). The time until the final zero crossing of random sums with application to nonparametric bandit theory. Applied Mathematics and Computation, 63(2-3), 235-263. http://doi.org/10.1016/0096-3003(94)90197-X
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/5575

Link to Full Text

COinS

Michigan Tech Publications, Part 1

The time until the final zero crossing of random sums with application to nonparametric bandit theory

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

LINKS

Browse

Search

Author Corner

Links

Michigan Tech Publications, Part 1

The time until the final zero crossing of random sums with application to nonparametric bandit theory

Authors

Document Type

Publication Date

Department

Abstract

Publication Title

Recommended Citation

Share

LINKS

Browse

Search

Author Corner

Links