The Conference on Computer Vision and Pattern Recognition (CVPR) 2021 is being hosted virtually from June 19th - June 25th. We’re excited to share all the work from SAIL that’s being presented, and you’ll find links to papers, videos and blogs below. Feel free to reach out to the contact authors directly to learn more about the work that’s happening at Stanford!
List of Accepted Papers
GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving
Authors: Yun Chen*, Frieda Rong*, Shivam Duggal*, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun
Contact: chenyuntc@gmail.com
Award nominations: Oral, Best Paper Finalist
Links: Paper | Video | Website
Keywords: computer vision, simulation, image simulation, video simulation, self-driving, autonomous driving, 3d vision, computer graphics, robotics
Greedy hierarchical variational autoencoders for large-scale video prediction
Authors: Bohan Wu, Suraj Nair, Roberto Martin-Martin, Li Fei-Fei*, Chelsea Finn*
Contact: bohanwu@stanford.edu
Keywords: variational autoencoders, video prediction
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning
Authors: Madeleine Grunde-McLaughlin
Contact: mgrund@sas.upenn.edu
Links: Paper | Video | Website
Keywords: visual question answering, compositionality, computer vision, benchmark
ArtEmis: Affective Language for Visual Art
Authors: Panos Achlioptas, Maks Ovsjanikov, Kilichbek Haydarov, Mohamed Elhoseiny, Leonidas Guibas
Contact: panos@cs.stanford.edu
Award nominations: Oral
Links: Paper | Video | Website
Keywords: affective-computing, wikiart, neural-speakers, emotions
DARCNN: Domain Adaptive Region-based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images
Authors: Joy Hsu, Wah Chiu, Serena Yeung
Contact: joycj@stanford.edu
Links: Paper | Website
Keywords: unsupervised domain adaptation, instance segmentation
Hierarchical Motion Understanding via Motion Programs
Authors: Sumith Kulal*, Jiayuan Mao*, Alex Aiken, Jiajun Wu
Contact: sumith@cs.stanford.edu
Links: Paper | Video | Website
Keywords: neuro-symbolic, motion, primitives, programs
Home Action Genome: Cooperative Compositional Action Understanding
Authors: Nishant Rai
Contact: nishantr018@gmail.com
Links: Paper | Website
Keywords: multi modal, multi camera view, multi perspective, action recognition, action localization, atomic actions, scene graphs, contrastive learning, audio-visual, large scale dataset
Joint Learning of 3D Shape Retrieval and Deformation
Authors: Mikaela Angelina Uy, Vladimir G. Kim, Minhyuk Sung, Noam Aigerman, Siddhartha Chaudhuri, Leonidas Guibas
Contact: mikacuy@stanford.edu
Links: Paper | Video | Website
Keywords: joint learning, retrieval, deformation
Metadata Normalization
Authors: Mandy Lu, Qingyu Zhao, Jiequan Zhang, Kilian M. Pohl, Li Fei-Fei, Juan Carlos Niebles, Ehsan Adeli
Contact: mlu@cs.stanford.edu
Links: Paper | Website
Keywords: metadata, normalization, bias, deep learning, bias-free feature learning
We look forward to seeing you at CVPR 2021!