Non-asymptotic bounds for stochastic optimization with a biased noisy gradient oracle : vTools Events

IEEE.org | IEEE Xplore Digital Library | IEEE Standards | IEEE Spectrum | More Sites

Non-asymptotic bounds for stochastic optimization with a biased noisy gradient oracle

#Stochastic #Optimization #randomized #stochastic #gradient #biased #oracle

We introduce an optimization oracle to capture a setting where the function measurements
have an estimation error that can be controlled through a batch size parameter. Our
proposed `biased gradient oracle' is appealing in practical contexts, where either the
objective has to be estimated from Markovian samples, or the estimation scheme is 'biased'
due to computational constraints. In either case, increasing the batch size reduces the
estimation error. We highlight the applicability of our biased gradient oracle in a
risk-sensitive reinforcement learning setting. In the stochastic non-convex optimization
context, we analyze a variant of the randomized stochastic gradient (RSG) algorithm with
a biased gradient oracle. We quantify the convergence rate of this algorithm by deriving
non-asymptotic bounds on its performance. Next, in the stochastic convex optimization
setting, we derive non-asymptotic bounds that hold in expectation for the last iterate of a
stochastic gradient descent (SGD) algorithm.

Date and Time

Location

Hosts

Registration

Add Event to Calendar
iCal
Google Calendar

If you are not a robot, please complete the ReCAPTCHA to display virtual attendance info.

kanpur, Uttar Pradesh
India 208016

Contact Event Host

Speakers

Prashanth L.A

Topic:

Non-asymptotic bounds for stochastic optimization with a biased noisy gradient oracle

Biography:

Prashanth L.A. is an Assistant Professor in the Department of Computer Science and Engineering at Indian Institute of Technology Madras. Prior to this, he was a postdoctoral researcher at the Institute for Systems Research, University of Maryland - College Park from 2015 to 2017 and at INRIA Lille - Team SequeL from 2012 to 2014. From 2002 to 2009, he was with Texas Instruments (India) Pvt Ltd, Bangalore, India.

He received his Masters and Ph.D degrees in Computer Science and Automation from Indian Institute of Science, in 2008 and 2013, respectively. He was awarded the third prize for his Ph.D. dissertation, by the IEEE Intelligent Transportation Systems Society (ITSS). He is the coauthor of a book entitled 'Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods', published by Springer in 2013. His research interests are in reinforcement learning, simulation optimization and multi-armed bandits, with applications in transportation systems, wireless networks and recommendation systems.

Address:India