Alireza Fathi : Research

Egocentric (First-Person) Vision

An egocentric vision system, is a framework consisting of a wearable camera that continuoulsy captures the scene in front of the first-person. In particular, I define an egocentric vision system as a framework that leverages different levels of first-person attention to identify important objects and faces in the scene that contribute to subject's activities. First-person's attitude, including where she looks (gaze) and what she does (hands manipulating objects) provide an invaluable context for determining the objects that grab her attention at any given time. Our goal is to use these structured sources of information coming from first-person in order to enable weakly supervised recognition of objects and activities.

Yin Li, Alireza Fathi, James M. Rehg, Learning to Predict Gaze in Egocentric Video, ICCV, 2013. (PDF, GTEA Gaze(+) Dataset)

Alireza Fathi, James M. Rehg, Modeling Actions through State Changes, CVPR, 2013. (PDF)

Alireza Fathi, Yin Li, James M. Rehg, Learning to Recognize Daily Actions using Gaze, ECCV, 2012. (PDF, GTEA Gaze(+) Dataset)

Alireza Fathi, Jessica K. Hodgins, James M. Rehg, Social Interactions: A First-Person Perspective, CVPR, 2012. (PDF, Dataset)

Alireza Fathi, Ali Farhadi, James M. Rehg, Understanding Egocentric Activities, ICCV, 2011. (PDF, Dataset)

Alireza Fathi, Xiaofeng Ren, James M. Rehg, Learning to Recognize Objects in Egocentric Activities, CVPR, 2011. (PDF, Dataset)

Action Recognition

I aim at developing action recognition techniques that rely on semantically meaningful features which capture interaction of objects with each other. This is in contrast to state of the art techniques that are based on space-time interest points or point trajectories. Many actions involve similar dynamics and hand-object relationships, but differ in their purpose and meaning. The key to differentiating these actions is the ability to identify how they change the state of objects and materials in the environment.

Alireza Fathi, James M. Rehg, Modeling Actions through State Changes, CVPR, 2013. (PDF)

Alireza Fathi, Greg Mori, Action Recognition by Learning Mid-level Motion Features, CVPR, 2008. (PDF)

Segmentation

Segmentation is one of the most fundamental problems in computer vision. If segmentation is solved, many of the big challenges in the field become trivial.

Alireza Fathi, Maria Florina Balcan, Xiaofeng Ren, James M. Rehg, Combining Self Training and Active Learning for Video Segmentation, BMVC, 2011 (PDF, Abstract, Software).

Human Pose Estimation

Alireza Fathi and Greg Mori, Human Pose Estimation using Motion Exemplars, ICCV, 2007. (PDF, Bibtex, More Information, Slides, Course Project that led to this paper)

MSc Thesis: Alireza Fathi, Human Figure Tracking using Motion Exemplars, Department of Computing Science, Simon Fraser University, 2008. (PDF)

Localization and Mapping

Helped in developing GTSAM as part of Frank Dellaert's team.

Alireza Fathi, John Krumm, Detecting Road Intersections from GPS Traces, GIScience, 2010. (PDF)

Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni and Frank Dellaert, EasySLAM, GVU Technical Report(GIT-GVU-10-03), 2010. (Link)

Frank Dellaert, Alireza Fathi, Alex Cunninghum, Balmanohar Paluri and Kai Ni, Local Exponential Maps: Towards Massively Distributed Multi-robot Mapping, GVU Technical Report(GIT-GVU-10-04), 2010. (Link)

Knowledge Based Recognition

RoboCup

Nasrin Mostafazadeh, Saba Ardeshiri, Sepideh Movaghati, Shadi Hariri, Zeinab Jahanzad, Alireza Fathi, Majid Valipour, Poseidon Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

Saman Aliari Zonouz, Hamid Reza Vaezi Joze, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged RoboCup Technical report, RoboCup 2006, Bremen, Germany. (PDF)

Hamid Reza Vaezi Joze, Saman Aliari Zonouz, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

Jafar Habibi, Alireza Fathi, Saeed Hassanpour, Mohammad Reza Ghodsi, Behzad Sadjadi, Hamid Reza Vaezi, Majid Valipour, Impossibles Team Description Paper, RoboCup 2005, Osaka, Japan. (PDF)

My Egocentric Vision Toolbox (Code in OpenCV with CUDA)