Gaze Estimation with an Ensemble of Four Architectures

Abstract

This paper presents a method for gaze estimation according to face images. We train several gaze estimators adopting four different network architectures, including an architecture designed for gaze estimation (i.e.,iTracker-MHSA) and three originally designed for general computer vision tasks(i.e., BoTNet, HRNet, ResNeSt). Then, we select the best six estimators and ensemble their predictions through a linear combination. The method ranks the first on the leader-board of ETH-XGaze Competition.

Publication
Technical report for ETH-XGaze Challenge @CVPR2021
Xin Cai
Xin Cai
Master student(co-adviced)
Jiabei Zeng
Jiabei Zeng
Associate Professor
Yunjia Sun
Yunjia Sun
Ph.D. student (co-adviced)
Xilin Chen
Xilin Chen
Professor
Shiguang Shan
Shiguang Shan
Professor