Human vs. Machine Minds: Ego-Centric Action Recognition Compared

Abstract: Human vs. Machine Action Recognition

Humans reliably surpass the performance of the most advanced AI models in action recognition, especially in real-world scenarios with low resolution, occlusions, and visual clutter. These models are somewhat similar to humans in using architecture that allows hierarchical feature extraction. However, they prioritise different features, leading to notable differences in their recognition. This study investigated these differences by introducing Epic ReduAct, a dataset derived from Epic-Kitchens-100. It consists of Easy and Hard ego-centric videos across various action classes. Critically, our dataset incorporates the concepts of Minimal Recognisable Configuration (MIRC) and sub-MIRC derived by progressively reducing the spatial content of the action videos across multiple stages. This enables a controlled evaluation of recognition difficulty for humans and AI models. This study examines the fundamental differences between human and AI recognition processes. While humans, unlike AI models, demonstrate proficiency in recognising hard videos, they experience a sharp decline in recognition ability as visual information is reduced, ultimately reaching a threshold beyond which recognition is no longer possible. In contrast, the AI models examined in this study appeared to exhibit greater resilience within this specific context, with recognition confidence decreasing gradually or, in some cases, even increasing at later reduction stages. These findings suggest that the limitations observed in human recognition do not directly translate to AI models, highlighting the distinct nature of their processing mechanisms.

Frequently Asked Questions

What is the Epic ReduAct dataset?

The Epic ReduAct dataset is derived from the Epic-Kitchens-100 dataset and is designed to compare human and AI performance in ego-centric action recognition.

How does this research benefit AI development?

Our research highlights the differences in recognition mechanisms between humans and AI, providing insights for improving AI models in challenging real-world scenarios.

Where can I access the dataset and code?

You can access the dataset and code on our GitHub repository.

BibTeX

@inproceedings{Rahmani:HumanvsMachine:CVPRWS:2025, AUTHOR = "Rahmani, Sadegh and Rybansky, Filip and Vuong, Quoc and Guerin, Frank and Gilbert, Andrew", TITLE = "Human vs. Machine Minds: Ego-Centric Action Recognition Compared", BOOKTITLE = "IEEE/CVF Conference on Computer Vision and Pattern Recognition - Workshop on Multimodal Algorithmic Reasoning (MAR'25)", YEAR = "2025", }

Human vs. Machine Minds: Ego-Centric Action Recognition Compared

Explore the differences between human and AI action recognition in ego-centric videos.

Abstract: Human vs. Machine Action Recognition

Want to Learn More?

Epic ReduAct Dataset

Frequently Asked Questions

What is the Epic ReduAct dataset?

How does this research benefit AI development?

Where can I access the dataset and code?

Key Findings from the Epic ReduAct Dataset

Spotlight

Poster

BibTeX