학술논문

Stochastic online optimization. Single-point and multi-point non-linear multi-armed bandits. Convex and strongly-convex case.
Document Type
Article
Source
Automation & Remote Control. Feb2017, Vol. 78 Issue 2, p224-234. 11p.
Subject
*STOCHASTIC analysis
*CONVEX functions
*SOUND measurement
*ALGEBRA
*AUTOMATION
Language
ISSN
0005-1179
Abstract
In this paper the gradient-free modification of the mirror descent method for convex stochastic online optimization problems is proposed. The crucial assumption in the problem setting is that function realizations are observed with minor noises. The aim of this paper is to derive the convergence rate of the proposed methods and to determine a noise level which does not significantly affect the convergence rate. [ABSTRACT FROM AUTHOR]