GTEA Gaze+ Dataset


We are collecting this dataset using SMI eye-tracking glasses. We are more than half-way through, and here we have made the collected and annotated data available. 




We are collecting this dataset at Georgia Tech's AwareHome. This dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking recipes (get the recipes here).
Activities are: American Breakfast, Pizza, Snack, Greek Salad, Pasta Salad, Turkey Sandwich and Cheese Burger. SMI glasses record a HD video of subjects activities at 24 frames per second. They also record subject's gaze at 30 fps. This dataset contains more than hundreds of objects.
For each activity, we have used ELAN to annotate its actions. An activity is a long meal-preparation task such as making pizza, and an action is a short meaningful temporal segment such putting sauce on the pizza crust, dicing the green peppers, distributing saussages on the pizza crust, washing the mushrooms, etc.

Download The Dataset


The table below contains the following data for each activity performed by each subject: Egocentric Video (V), Audio (A), Gaze (G), Action Annotations (N)

Subject/Activity

American Breakfast

Pizza (Special)

Afternoon Snack

Greek Salad

Pasta Salad

Turkey Sandwich

Cheese Burger

Download all subject data

Yin

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

Alireza

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

Carlos

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

Rahul

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

V, A, G, N

Shaghayegh

V, A, G, N

V, A, G, N

V, A, G, N