Adoption of MMPose, a general purpose pose estimation library, for animal tracking
Choi, J. D.; Kumar, V.
Show abstract
1Markerless pose estimation has emerged as a powerful technique for animal behavior quantification, capable of high resolution tracking of body parts. Many neuroscience labs rely on tools like DeepLabCut and SLEAP, which provide accessible interfaces but restrict users to a narrow set of models and configurations. In this work, we adopt MMPose an open source, general-purpose computer vision library to build a workflow for training and evaluating multiple state-of-the-art models on animal video datasets. We benchmark these models in two scenarios: (1) a complex maze assay with occlusions and varied backgrounds, and (2) a simpler open field arena with a high-contrast background. Our results show that a bottomup model (DEKR) delivers the highest accuracy in the complex task, whereas lighter-weight models (e.g., SLEAP) offer superior speed highlighting a clear trade-off between accuracy and throughput. We also evaluate a recently published foundation model (TopViewMouse-5K) trained on a large top-view mouse dataset to test its generalization. It performs poorly on our tasks at zero-shot, and even when we combine its data with our training set, we observe no consistent benefit. These findings emphasize the importance of context-specific model selection and the need for more diverse training data to create generalizable pose models. By leveraging a general-purpose vision library, researchers can flexibly choose models that best suit their experimental needs. This work illustrates how adopting advanced computer vision frameworks can accelerate behavioral neuroscience and genetics research, paving the way for more scalable, reproducible, and sensitive analysis of animal behavior.
Matching journals
The top 8 journals account for 50% of the predicted probability mass.