0

The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction

A bottleneck-based attention model called Temporal Progressive (TemPr) achieves state-of-the-art performance in early action prediction by leveraging progressive sampling across multiple scales.

Year
2022
Venue
CVPR 2023 1
Authors
2
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2204.13340v2ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Early action prediction deals with inferring the ongoing action from partially-observed videos, typically at the outset of the video. We propose a bottleneck-based attention model that captures the evolution of the action, through progressive sampling over fine-to-coarse scales. Our proposed Temporal Progressive (TemPr) model is composed of multiple attention towers, one for each scale. The predicted action label is based on the collective agreement considering confidences of these towers. Extensive experiments over four video datasets showcase state-of-the-art performance on the task of Early Action Prediction across a range of encoder architectures. We demonstrate the effectiveness and consistency of TemPr through detailed ablations.

Authors

2