All Posts - Page 3

Demystifying Verbatim Memorization in Large Language Models

Jing Huang, Diyi Yang, Christopher Potts

How do LLMs memorize long sequences of texts verbatim? In this work, we show that verbatim memorization is intertwined with the LM’s general capabilities.

Continue reading

Stanford AI Lab Papers and Talks at NAACL 2025

Compiled by Nitya Thakkar

All the great work from the Stanford AI Lab accepted at NAACL, all in one place.

Continue reading

Stanford AI Lab Papers and Talks at ICLR 2025

Compiled by Megha Srivastava

All the great work from the Stanford AI Lab accepted at ICLR 2025, all in one place.

Continue reading

MENTAT: A Clinician-Annotated Benchmark for Complex Psychiatric Decision-Making

Max Lamparth and Declan Grabb

We developed a new expert design and annotated clinical decision-making dataset that also allows for nuanced accuracy and fairness evaluations with expert preferences, uncertainty, and soft labels.

Continue reading

Stanford AI Lab Graduates 2025

Compiled by Nikil Selvam, Alex Nam, and Judy Hanwen Shen

A list of our excellent students graduating roughly in summer 2025.

Continue reading

PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action

Yijia Shao and Diyi Yang

Having an agent handle tasks for you is cool. But does your language model agent respect privacy norms?

Continue reading

Productive Struggle: The Future of Human Learning in the Age of AI

Rose E. Wang and Megha Srivastava

What happens to human learning when superhuman intelligence is as accessible as a Google search?

Continue reading

MiniVLA: A Better VLA with a Smaller Footprint

Suneel Belkhale and Dorsa Sadigh

Reducing OpenVLA's parameters 7x, and improving the input and output representation space.

Continue reading

Stanford AI Lab Papers and Talks at NeurIPS 2024

Compiled by Ruhana Azam

All the great work from the Stanford AI Lab accepted at NeurIPS, all in one place.

Continue reading

Unintended Impacts of Alignment on Global Representation

Michael J. Ryan, William Held, Diyi Yang

How language model alignment impacts performance along three axes of global representation

Continue reading