Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 9 von 13
2018 IEEE International Symposium on Information Theory (ISIT), 2018, p.1006-1010
2018
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
Skyline Identification in Multi-Arm Bandits
Ist Teil von
  • 2018 IEEE International Symposium on Information Theory (ISIT), 2018, p.1006-1010
Ort / Verlag
IEEE
Erscheinungsjahr
2018
Quelle
IEEE Xplore
Beschreibungen/Notizen
  • We introduce a variant of the classical PAC multi-armed bandit problem. There is an ordered set of n arms A[1], ⋯, A[n], each with some stochastic reward drawn from some unknown bounded distribution. The goal is to identify the skyline of the set A, consisting of all arms A[i] such that A[i] has larger expected reward than all lower-numbered arms A[1], ⋯, A[i-1]. We define a natural notion of an ε-approximate skyline and prove matching upper and lower bounds for identifying an ε-skyline. Specifically, we show that in order to identify an ε -skyline from among arms with probability 1-δ, Θ([n/(ε 2 )]·min{log([1/(εδ)]), log([n/(δ)])}) samples suffice and are necessary in the -worst case. When ε ≫ 1/n, our results improve over the naïve algorithm, which draws enough samples to approximate the expected reward of every arm; the algorithm of (Auer et al., AISTATS'16) for Pareto-optimal arm identification is likewise superseded. Our results show that the sample complexity of the skyline problem lies strictly in between that of best arm identification (Even-Dar et al., COLT'02) and that of approximating the expected reward of every arm. [Full version available on arXiv: arxiv.org/abs/1711.04213].
Sprache
Englisch
Identifikatoren
eISSN: 2157-8117
DOI: 10.1109/ISIT.2018.8437618
Titel-ID: cdi_ieee_primary_8437618

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX