Page 4 of 13 for All Posts - Page 4

Discovering the systematic errors made by machine learning models

Sabri Eyuboglu, Maya Varma, Khaled Saab, Jared Dunnmon, James Zou and Chris Ré

Machine learning models that achieve high overall accuracy often make systematic errors on coherent slices of validation data. Can we develop methods to automatically identify these systematic errors?

Grading Complex Interactive Coding Programs with Reinforcement Learning

Allen Nie, Emma Brunskill, and Chris Piech

Using reinforcement learning agent to simultaneously learn to play and grade student's homework.

Understanding Deep Learning Algorithms that Leverage Unlabeled Data, Part 1: Self-training

Colin Wei, Jeff Z. HaoChen, and Tengyu Ma

Theoretical analysis of self-training algorithms for leveraging unlabeled data.

Stanford AI Lab Papers and Talks at AAAI 2022

Compiled by Drew A. Hudson

All the great work from the Stanford AI Lab accepted at AAAI 2022, all in one place.

How to Improve User Experience (and Behavior): Three Papers from Stanford's Alexa Prize Team

Amelia Hardy, Haojun Li, and Abigail See

Strategies for understanding user dissatisfaction, handling offensiveness, and increasing user initiative, to improve conversational experience.

Reward Isn't Free: Supervising Robot Learning with Language and Video from the Web

Suraj Nair

Where do the rewards for robotic reinforcement learning come from? In this blog post we study how using crowdsourced language annotations and videos of humans, we can learn reward functions in a scalable way and enable them to generalize more broadly.

BanditPAM: Almost Linear-Time k-medoids Clustering via Multi-Armed Bandits

Mo Tiwari

We present an almost-linear time algorithm for the k-medoids problem that matches prior SOTA in clustering quality. Our solution has almost the same complexity as k-means and several advantages.