Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task

Jason Li; Nicholas Watters; Hansem Sohn; Mehrdad Jazayeri

Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task

Jason Li, Nicholas Watters, Hansem Sohn, Mehrdad Jazayeri

Proceedings of The 1st Gaze Meets ML workshop, PMLR 210:98-112, 2023.

Abstract

From smoothly pursuing moving objects to rapidly shifting gazes during visual search, humans employ a wide variety of eye movement strategies in different contexts. While eye movements provide a rich window into mental processes, building generative models of eye movements is notoriously difficult, and to date the computational objectives guiding eye movements remain largely a mystery. In this work, we tackled these problems in the context of a canonical spatial planning task, maze-solving. We collected eye movement data from human subjects and built deep generative models of eye movements using a novel differentiable architecture for gaze fixations and gaze shifts. We found that human eye movements are best predicted by a model that is optimized not to perform the task as efficiently as possible but instead to run an internal simulation of an object traversing the maze. This not only provides a generative model of eye movements in this task but also suggests a computational theory for how humans solve the task, namely that humans use mental simulation.

Cite this Paper

BibTeX


@InProceedings{pmlr-v210-li23a,
  title = 	 {Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task},
  author =       {Li, Jason and Watters, Nicholas and Sohn, Hansem and Jazayeri, Mehrdad},
  booktitle = 	 {Proceedings of The 1st Gaze Meets ML workshop},
  pages = 	 {98--112},
  year = 	 {2023},
  editor = 	 {Lourentzou, Ismini and Wu, Joy and Kashyap, Satyananda and Karargyris, Alexandros and Celi, Leo Anthony and Kawas, Ban and Talathi, Sachin},
  volume = 	 {210},
  series = 	 {Proceedings of Machine Learning Research},
  month = 	 {03 Dec},
  publisher =    {PMLR},
  pdf = 	 {https://proceedings.mlr.press/v210/li23a/li23a.pdf},
  url = 	 {https://proceedings.mlr.press/v210/li23a.html},
  abstract = 	 {From smoothly pursuing moving objects to rapidly shifting gazes during visual search, humans employ a wide variety of eye movement strategies in different contexts. While eye movements provide a rich window into mental processes, building generative models of eye movements is notoriously difficult, and to date the computational objectives guiding eye movements remain largely a mystery. In this work, we tackled these problems in the context of a canonical spatial planning task, maze-solving. We collected eye movement data from human subjects and built deep generative models of eye movements using a novel differentiable architecture for gaze fixations and gaze shifts. We found that human eye movements are best predicted by a model that is optimized not to perform the task as efficiently as possible but instead to run an internal simulation of an object traversing the maze. This not only provides a generative model of eye movements in this task but also suggests a computational theory for how humans solve the task, namely that humans use mental simulation.}
}

Endnote

%0 Conference Paper
%T Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task
%A Jason Li
%A Nicholas Watters
%A Hansem Sohn
%A Mehrdad Jazayeri
%B Proceedings of The 1st Gaze Meets ML workshop
%C Proceedings of Machine Learning Research
%D 2023
%E Ismini Lourentzou
%E Joy Wu
%E Satyananda Kashyap
%E Alexandros Karargyris
%E Leo Anthony Celi
%E Ban Kawas
%E Sachin Talathi	
%F pmlr-v210-li23a
%I PMLR
%P 98--112
%U https://proceedings.mlr.press/v210/li23a.html
%V 210
%X From smoothly pursuing moving objects to rapidly shifting gazes during visual search, humans employ a wide variety of eye movement strategies in different contexts. While eye movements provide a rich window into mental processes, building generative models of eye movements is notoriously difficult, and to date the computational objectives guiding eye movements remain largely a mystery. In this work, we tackled these problems in the context of a canonical spatial planning task, maze-solving. We collected eye movement data from human subjects and built deep generative models of eye movements using a novel differentiable architecture for gaze fixations and gaze shifts. We found that human eye movements are best predicted by a model that is optimized not to perform the task as efficiently as possible but instead to run an internal simulation of an object traversing the maze. This not only provides a generative model of eye movements in this task but also suggests a computational theory for how humans solve the task, namely that humans use mental simulation.

APA


Li, J., Watters, N., Sohn, H. & Jazayeri, M.. (2023). Modeling Human Eye Movements with Neural Networks in a Maze-Solving Task. Proceedings of The 1st Gaze Meets ML workshop, in Proceedings of Machine Learning Research 210:98-112 Available from https://proceedings.mlr.press/v210/li23a.html.

Related Material

Download PDF