Like Egocentric Vision on Facebook

Alireza Fathi


Postdoctoral Scholar

Gates Building, Office 244

Vision Lab
Department of Computer Science
Stanford University

Email:  alireza@cs.stanford.edu

C.V. | Ph.D Dissertation | Datasets/Software | Publications | Invited Talks | Services | Courses | Useful Links | Experience | Fun! | My Sister
 

Jul 13-Present Aug 08-Jun 13 May 11-Aug 11 May 09-Aug 09 Aug 06-Jul 08 May 07-Aug 07 Aug 02-Jul 06

I am currently a postdoctoral scholar at Stanford University, working with FeiFei Li. I received my Ph.D from Georgia Tech. My thesis was about Learning Descriptive Models of Objects and Activities from Egocentric Video, and my adviser was Jim Rehg. I have had the chance to closely work with many other great people in the past: Greg Mori, Jessica Hodgins, Gregory Abowd, Frank Dellaert and John Krumm.

News

03/07/2014: Organizing a Workshop on Visual Perception of Object and Scene Affordances in Conjunction with ECCV 2014 with Karthik Mahesh Varadarajan.

03/07/2014: Submitted 3 papers to ECCV 2014 with Yuke, Serena, and Fei-Fei.

01/04/2014: Co-teaching the Advanced Reading in Computer Vision with FeiFei.

12/13/2013: With Kris Kitani, Yong Jae Lee and Michael Ryoo, we are organizing the third workshop on Egocentric Vision in conjunction with CVPR 2014. We are looking forward to your paper, abstract and poster submissions.

08/28/2013: Paper accepted to ICCV 2013, Learning to Predict Gaze in Egocentric Video (PDF, GTEA Gaze(+) Dataset).

08/09/2013: I have released some code for segmenting and recognizing objects, hands, and actions in egocentric video. Get it here.

05/03/2013: I defended on May 22nd, 2013. My thesis committee members were James Rehg, Martial Hebert, Antonio Torralba, Gregory Abowd, Aaron Bobick and Thad Starner. Here is my Ph.D Dissertation , and here are my defense presentation slides (PPTX, PDF).

02/24/2013: Paper accepted to CVPR 2013, Modeling Actions through State Changes (PDF).

 

Publications

Google Scholar

Learning to Predict Gaze in Egocentric Video

Yin Li, Alireza Fathi, James M. Rehg

ICCV, 2013 (PDF)

Learning Descriptive Models of Objects and Activities from Egocentric Video

Alireza Fathi

Ph.D. Thesis, 2013 (PDF)

Modeling Actions through State Changes *

Alireza Fathi, James M. Rehg

CVPR, 2013 (PDF)

Learning to Recognize Daily Actions using Gaze

Alireza Fathi, Yin Li, James M. Rehg

ECCV, 2012 (PDF, Project Page)

Detecting Eye Contact using Wearable Eye-Tracking Glasses

Zhefan Ye, Yin Li, Alireza Fathi, Yi Han, Agata Rozga, Gergory D. Abowd, James M. Rehg

2nd Workshop on Pervasive Eye Tracking and Mobile Eye-based Interaction (in conjunction with UbiComp), 2012 (PDF)

Social Interactions: A First-Person Perspective

Alireza Fathi, Jessica K. Hodgins, James M. Rehg

CVPR, 2012 (PDF, Dataset)

Understanding Egocentric Activities *

Alireza Fathi, Ali Farhadi, James M. Rehg

ICCV, 2011 (PDF, Dataset)

Learning to Recognize Objects in Egocentric Activities *

Alireza Fathi, Xiaofeng Ren, James M. Rehg

CVPR, 2011 (PDF, Dataset)

Combining Self Training and Active Learning for Video Segmentation *

Alireza Fathi, Maria Florina Balcan, Xiaofeng Ren, James M. Rehg

BMVC, 2011 (PDF, Abstract, Software)

Detecting Road Intersections from GPS Traces

Alireza Fathi, John Krumm

GIScience, 2010 (PDF)

Action Recognition by Learning Mid-Level Motion Features

Alireza Fathi, Greg Mori

CVPR, 2008 (PDF, Bibtex)

Human Pose Estimation using Motion Exemplars

Alireza Fathi, Greg Mori

ICCV, 2007 (PDF, Bibtex, More Information, Slides, Course Project that led to this paper)

Voice Synthesis using the Generalized Pressure-Controlled Valve

Tamara Smyth, Alireza Fathi

International Computer Music Conference (ICMC), 2008 (PDF)

A Standard Workflow for Illumination-Invariant Image Extraction

Mark S. Drew, Muntaseer Salahuddin, Alireza Fathi

15th Color and Imaging Conference, 2007 (PDF)

EasySLAM

Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni and Frank Dellaert

GVU Technical Report (GIT-GVU-10-03), 2010. (Link)

Local Exponential Maps: Towards Massively Distributed Multi-Robot Mapping

Frank Dellaert, Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni

GVU Technical Report(GIT-GVU-10-04), 2010. (Link)

Poseidon Team Description Paper

Nasrin Mostafazadeh, Saba Ardeshiri, Sepideh Movaghati, Shadi Hariri, Zeinab Jahanzad, Alireza Fathi, Majid Valipour

Ranked 2nd in Rescue Simulation League, Robocup 2006, Bremen, Germany (PDF)

Impossibles Sony Aibo 4-Legged RoboCup Technical report

Saman Aliari Zonouz, Hamid Reza Vaezi Joze, Siavash Rahbar, Majid Valipour, Alireza Fathi

RoboCup 2006, Bremen, Germany. (PDF)

Impossibles Sony Aibo 4-Legged RoboCup Team Description Paper

Hamid Reza Vaezi Joze, Saman Aliari Zonouz, Siavash Rahbar, Majid Valipour, Alireza Fathi

RoboCup 2006, Bremen, Germany. (PDF)

Impossibles Team Description Paper

Jafar Habibi, Alireza Fathi, Saeed Hassanpour, Mohammad Reza Ghodsi, Behzad Sadjadi, Hamid Reza Vaezi, Majid Valipour

Ranked 1st in Rescue Similation League, RoboCup 2005, Osaka, Japan (PDF)

* These works were partially funded by the Intel Science and Technology Center for Pervasive Computing.
 
Invited Talks

Social Interactions: A First-Person Perspective

Predicting Attention and Action in Egocentric Video

Learning Descriptive Models of Objects and Activities from Egocentric Video


 
Datasets/Software
 
Interactive Image Segmentation Toolbox
 
 
Egocentric Vision Toolbox
 
 

GTEA Gaze(+)

 
 

Social Interactions at Disney parks

 
 

Georgia Tech Egocentric Activities (GTEA)

 
Projects

Egocentric (First-Person) Vision: An egocentric vision system, is a framework consisting of a wearable camera that continuoulsy captures the scene in front of the first-person. In particular, I define an egocentric vision system as a framework that leverages different levels of first-person attention to identify important objects and faces in the scene that contribute to subject's activities. First-person's attitude, including where she looks (gaze) and what she does (hands manipulating objects) provide an invaluable context for determining the objects that grab her attention at any given time. Our goal is to use these structured sources of information coming from first-person in order to enable weakly supervised recognition of objects and activities.

  • Alireza Fathi, Yin Li, James M. Rehg, Learning to Recognize Daily Actions using Gaze, ECCV, 2012. (PDF, GTEA Gaze(+) Dataset)

  • Alireza Fathi, Jessica K. Hodgins, James M. Rehg, Social Interactions: A First-Person Perspective, CVPR, 2012. (PDF, Dataset)

  • Alireza Fathi, Ali Farhadi, James M. Rehg, Understanding Egocentric Activities, ICCV, 2011. (PDF, Dataset)

  • Alireza Fathi, Xiaofeng Ren, James M. Rehg, Learning to Recognize Objects in Egocentric Activities, CVPR, 2011. (PDF, Dataset)



    Video and Image Segmentation: I believe that segmentation is probably the most fundamental problem in computer vision. If segmentation is solved, many of the big challenges in the field become trivial.

  • Alireza Fathi, Maria Florina Balcan, Xiaofeng Ren, James M. Rehg, Combining Self Training and Active Learning for Video Segmentation, BMVC, 2011 (PDF, Abstract, Software).



    Action Recognition (ICCV07, CVPR08, ICCV11, CVPR12, MSc Thesis): I aim at developing action recognition techniques that rely on semantically meaningful features which capture interaction of objects with each other. This is in contrast to state of the art techniques that are based on space-time interest points or point trajectories.

  • Alireza Fathi, Greg Mori, Action Recognition by Learning Mid-level Motion Features, CVPR, 2008. (PDF, Bibtex)

    Human Pose Estimation:

  • Alireza Fathi and Greg Mori, Human Pose Estimation using Motion Exemplars, ICCV, 2007. (PDF, Bibtex, More Information, Slides, Course Project that led to this paper)

  • MSc Thesis: Alireza Fathi, Human Figure Tracking using Motion Exemplars, Department of Computing Science, Simon Fraser University, 2008. (PDF)
     
     
     

    Localization and Mapping:

  • Helped in developing GTSAM as part of Frank Dellaert's team.

  • Alireza Fathi, John Krumm, Detecting Road Intersections from GPS Traces, GIScience, 2010. (PDF)

  • Alireza Fathi, Alex Cunninghum, Balmanohar Paluri, Kai Ni and Frank Dellaert, EasySLAM, GVU Technical Report(GIT-GVU-10-03), 2010. (Link)

  • Frank Dellaert, Alireza Fathi, Alex Cunninghum, Balmanohar Paluri and Kai Ni, Local Exponential Maps: Towards Massively Distributed Multi-robot Mapping, GVU Technical Report(GIT-GVU-10-04), 2010. (Link)
     

     

     

    Color Constancy and Illumination Invariance:

  • Mark S. Drew, Muntaseer Salahuddin, Alireza Fathi, A Standardized Workflow for Illumination-Invariant Image Extraction, 15th Color Imaging Conference, New Mexico, 2007. (PDF)
     

    RoboCup:

     
  • Nasrin Mostafazadeh, Saba Ardeshiri, Sepideh Movaghati, Shadi Hariri, Zeinab Jahanzad, Alireza Fathi, Majid Valipour, Poseidon Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

  • Saman Aliari Zonouz, Hamid Reza Vaezi Joze, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged RoboCup Technical report, RoboCup 2006, Bremen, Germany. (PDF)

  • Hamid Reza Vaezi Joze, Saman Aliari Zonouz, Siavash Rahbar, Majid Valipour, Alireza Fathi, Impossibles Sony Aibo 4-Legged Team Description Paper, RoboCup 2006, Bremen, Germany. (PDF)

  • Jafar Habibi, Alireza Fathi, Saeed Hassanpour, Mohammad Reza Ghodsi, Behzad Sadjadi, Hamid Reza Vaezi, Majid Valipour, Impossibles Team Description Paper, RoboCup 2005, Osaka, Japan. (PDF)

     

     

    Other:

  • Tamara Smyth, Alireza Fathi, Voice Synthesis using the Generalized Pressure-Controlled Valve, International Computer Music Conference (ICMC), 2008. (PDF)
     
  • BSc Thesis: Alireza Fathi, Assembler and Simulator for IBM 360/370, Computer Engineering Department, Sharif University of Technology, 2004. (Director: Dr. Hamid Sarbazi Azad) (Persian PDF)

     

     

     

    Courses

     

    Services
     
    Co-Organizer of 3rd IEEE Workshop on Egocentric (First-Person) Vision in Conjunction with CVPR 2014.
    Co-Organizer of 2nd IEEE Workshop on Egocentric (First-Person) Vision in Conjunction with CVPR 2012.
    Conference Reviewer of: CVPR, ICCV, ECCV, BMVC, NIPS, AAAI, IROS, UbiComp, Face and Gesture, UIST, ...
    Journal Reviewer of: PAMI, CVIU, TCSVT
    Program Committee of: Workshop on Wearable Computer Vision at ICCV 2013, SenseCam Conference 2013 at UCSD
     

    Useful Links

  • Notes on Graduate Studies, Alireza Fathi, 2010.
  • How to Buy a Used Car, Alireza Fathi, 2010.
  • Recent Hot Machine Learning Hammers used in Computer Vision, Alireza Fathi, 2011.
  • My Facebook App for Data Collection
  • Simultaneous Recovery of Shape, Motion and Grouping by Applying Rank Constraints, 2010. (PDF, Poster)



    Experience

    Teaching Experience:

    Research Experience:

    Work Experience:

    Languages:

     

    Fun

    This video is seen by 20,000 people by now (Nov 2010)

     

     

    website statistics