Different Paradigms from Computer Vision Align with Human Assessment of the Mouse Grimace Scale

Reimann, M.; Aloui, J.; Obländer, N.; Andresen, N.; Hohlbaum, K.; Hellwich, O.; Reiske, P.

2026-03-06 animal behavior and cognition

10.64898/2026.03.04.709492 bioRxiv

Show abstract

Animal welfare is a central aspect in animal-based research where mice are most commonly used. Their facial expression can be analyzed to assess their well-being status using the Mouse Grimace Scale. However, its manual application becomes increasingly impractical when used on a large number of animals. This lead to the ongoing integration of computer vision methods to automate the analysis. While such methods have proven effective qualitatively, a systematic assessment to verify their reliability largely remains an open research gap. In this work, we attempted to close this gap as we evaluated three dominant paradigms (i.e., classification from supervised learning features, self-supervised learning features, or landmark locations) for the binary (i.e., well-being un-/impaired) classification of facial mouse images. Our quantitative results showed that such methods can be employed successfully with as low as 16% type II error rates. For qualitative assessment, we visualized the decision-making process and demonstrated that mainly pixels associated with the mouse rather than its environment are used. We further discovered that visual characteristics of the mice beyond those described by the Mouse Grimace Scale contributed to the classification. Our work showed that the automated well-being status assessment in mice is trustworthy and urges towards widespread adoption.

Different Paradigms from Computer Vision Align with Human Assessment of the Mouse Grimace Scale

Matching journals