GTEA Gaze+

GTEA Gaze+ Dataset

We are collecting this dataset using SMI eye-tracking glasses. We are more than half-way through, and here we have made the collected and annotated data available.

We are collecting this dataset at Georgia Tech's AwareHome. This dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking recipes (get the recipes here).
Activities are: American Breakfast, Pizza, Snack, Greek Salad, Pasta Salad, Turkey Sandwich and Cheese Burger. SMI glasses record a HD video of subjects activities at 24 frames per second. They also record subject's gaze at 30 fps. This dataset contains more than hundreds of objects.
For each activity, we have used ELAN to annotate its actions. An activity is a long meal-preparation task such as making pizza, and an action is a short meaningful temporal segment such putting sauce on the pizza crust, dicing the green peppers, distributing saussages on the pizza crust, washing the mushrooms, etc.

Download The Dataset

The table below contains the following data for each activity performed by each subject: Egocentric Video (V), Audio (A), Gaze (G), Action Annotations (N)

Subject/Activity	American Breakfast	Pizza (Special)	Afternoon Snack	Greek Salad	Pasta Salad	Turkey Sandwich	Cheese Burger	Download all subject data
Yin	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N		V, A, G, N	V, A, G, N
Alireza	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N
Carlos	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N
Rahul	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N	V, A, G, N
Shaghayegh	V, A, G, N	V, A, G, N	V, A, G, N