Stanford AI Lab Papers and Talks at ACL 2023

July 12, 2023

The 61st Annual Meeting of the Association for Computational Linguistics (ACL) 2023 is being hosted in Toronto, Canada on July 9th - 14th. We’re excited to share all the work from SAIL that’s being presented, and you’ll find links to papers, videos and blogs below. Feel free to reach out to the contact authors directly to learn more about the work that’s happening at Stanford!

List of Accepted Papers

Main Conference

Grokking of Hierarchical Structure in Vanilla Transformers

Authors: Shikhar Murty, Pratyusha Sharma, Jacob Andreas, Christopher Manning
Contact: shikhar.murty@gmail.com
Keywords: emergent syntactic structure, grokking, transformer interpretability, generalization

Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation

Authors: Martijn Bartelds, Nay San, Bradley McDonnell, Dan Jurafsky, Martijn Wieling
Contact: m.bartelds@rug.nl
Links: Paper | Video
Keywords: asr, data augmentation, language variants, low-resource languages, self-training, speech recognition, tts

Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models

Authors: Myra Cheng, Esin Durmus, Dan Jurafsky
Contact: myra1@stanford.edu
Award nominations: Nominated for best paper awards
Links: Paper
Keywords: bias in language models, stereotypes, markedness, personas, intersectionality, unsupervised, prompting

Neural Machine Translation for Mathematical Formulae

Authors: Felix Petersen, Moritz Schubotz, Andre Greiner-Petter, Bela Gipp
Contact: felixp@stanford.edu
Links: Paper | Video
Keywords: formula translation, equations, special functions, mathematical language processing, content language, presentation language, latex, semantic latex, mathematica

On Second Thought, Let’s Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning

Authors: Omar Shaikh, Hongxin Zhang, William Held, Michael Bernstein, Diyi Yang
Contact: oshaikh@stanford.edu
Links: Paper
Keywords: cot, bias, social nlp, reasoning

ACL Findings

Modeling Cross-Cultural Pragmatic Inference with Codenames Duet

Authors: Omar Shaikh, Caleb Ziems, William Held, Aryan J. Pariani, Fred Morstatter, Diyi Yang
Contact: oshaikh@stanford.edu
Links: Paper
Keywords: common ground, pragmatic inference, social nlp, games!

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models

Authors: Yuhui Zhang*, Michihiro Yasunaga*, Zhengping Zhou*, Jeff Z. HaoChen*, James Zou, Percy Liang, Serena Yeung
Contact: yuhuiz@stanford.edu
Links: Paper | Video | Website
Keywords: scaling, language model, negation

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

Authors: Mirac Suzgun, Nathan Scales, Nathanael Schärli, Sebastian Gehrmann, Yi Tay, Hyung Won Chung, Aakanksha Chowdhery, Quoc Le, Ed Chi, Denny Zhou, Jason Wei
Contact: msuzgun@cs.stanford.edu
Links: Paper | Website
Keywords: big bench hard, bbh, chain-of-thought, codex, palm, reasoning

Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding

Authors: Mirac Suzgun, Luke Melas-Kyriazi, Dan Jurafsky
Contact: msuzgun@cs.stanford.edu
Links: Paper | Website
Keywords: minimum bayes risk decoding, wisdom of the crowd, crowd sampling, open-ended generation, majority voting

Workshops

SIGHT: A Large Annotated Dataset on Student Insights Gathered from Higher Education Transcripts

Authors: Rose E. Wang*, Pawan Wirawarn*, Noah Goodman, Dorottya (Dora) Demszky
Contact: rewang@cs.stanford.edu
Links: Paper
Keywords: education, nlp, dataset
Workshop: Proceedings of Innovative Use of NLP for Building Educational Applications

Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction

Authors: Rose E. Wang, Dorottya (Dora) Demszky
Contact: rewang@cs.stanford.edu
Links: Paper | Video
Keywords: education, nlp, coaching
Workshop: Proceedings of Innovative Use of NLP for Building Educational Applications

BudgetLongformer: Can we Cheaply Pretrain a SotA Legal Language Model From Scratch?

Authors: Joel Niklaus, Daniele Giofré
Contact: jniklaus@stanford.edu
Links: Paper
Keywords: legal, efficient, pretraining, longformer, electra, billsum, pubmed, summarization
Workshop: SustaiNLP

We look forward to seeing you at ACL!

Keep on top of the latest SAIL Blog posts via , , or email: