may-ur08
/

timesformer-ucf101

Video Classification

action-recognition

Model card Files Files and versions

TimeSformer — UCF-101 Action Recognition

Implementation of Is Space-Time Attention All You Need for Video Understanding?
Trained on UCF-101 with divided space-time attention.

Model Config

Param	Value
DIM	768
DEPTH	12
HEADS	12
FRAMES	16
IMG_SIZE	224
Classes	101

Results

Split	Top-1	Top-5
Val	92.89%	97.91%
Test	91.64%	98.08%

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

Video Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for may-ur08/timesformer-ucf101

Is Space-Time Attention All You Need for Video Understanding?

Paper • 2102.05095 • Published Feb 9, 2021 • 2