AutoViDev: A computer-vision framework to enhance and accelerate research in human development

Using AutoViDev offline mode for automatic detection of infant locomotion. (a) Researchers manually track infants in a video (left) to determine their path over 20 minutes of free-play. Figure on the right depicts one 20-minute path that was manually extracted. (b) AutoViDev can achieve similar tracking by detecting the infant’s feet and generating the 20-minute path. The AutoViDev tracked path significantly matched the manually digitized path.


Interdisciplinary exchange of ideas and tools can accelerate scientific progress. For example, findings from developmental and vision science have spurred recent advances in artificial intelligence and computer vision. However, relatively little attention has been paid to how artificial intelligence and computer vision can facilitate research in developmental science. The current study presents AutoViDev—an automatic video-analysis tool that uses machine learning and computer vision to support video-based developmental research. AutoViDev identifies full body position estimations in real-time video streams using convolutional pose machine-learning algorithms. AutoViDev provides valuable information about a variety of behaviors, including gaze direction, facial expressions, posture, locomotion, manual actions, and interactions with objects. We present a high-level architecture of the framework and describe two projects that demonstrate its usability. We discuss the benefits of applying AutoViDev to large-scale, shared video datasets and highlight how machine learning and computer vision can enhance and accelerate research in developmental science.

In Computer Vision Conference (CVC) 2019