Page 2 of 2 for Reinforcement Learning Posts - Page 2

RoboTurk: Human Reasoning and Dexterity for Large-Scale Dataset Creation

We built a system that enables collecting large-scale robot manipulation datasets with human supervision and used it to collect the largest robot dataset ever collected via teleoperation.

AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers

Andrey Kurenkov and Ajay Mandlekar

Presenting AC-Teach, a unifying approach to leverage advice from an ensemble of sub-optimal teachers in order to accelerate the learning process of actor-critic reinforcement learning agents.

Policy Certificates and Minimax-Optimal PAC Bounds for Episodic Reinforcement Learning

Christoph Dann

Introducing a new method that achieves minimax-optimal probably approximately correct (and regret) bounds which match the statistical worst-case lower bounds in the dominating terms for reinforcement learning.

Progress Toward Safe and Reliable AI

Steve Eglash

An overview of research at SAIL related to new techniques that allow us to look inside the black box of neural networks, to how it is possible to find and remove bias, and to how safety in autonomous systems can be assured.