The Stanford AI Lab Blog
About
Posts
All
Conferences
Computer Vision
Robotics
NLP
Machine Learning
Reinforcement Learning
Subscribe
SAIL
Robotics Posts - Page 3
Batch-Active Preference-Based Learning of Reward Functions
Erdem Bıyık
Efficient reward learning is hard. With a focus on preference-based learning methods, we show how sample-efficiency can be achieved along with computational efficiency by using batch-active methods.
Continue reading
Prev