Is Space-Time Attention All You Need for Video Understanding?
Paper โข 2102.05095 โข Published โข 2
Implementation of Is Space-Time Attention All You Need for Video Understanding?
Trained on UCF-101 with divided space-time attention.
| Param | Value |
|---|---|
| DIM | 768 |
| DEPTH | 12 |
| HEADS | 12 |
| FRAMES | 16 |
| IMG_SIZE | 224 |
| Classes | 101 |
| Split | Top-1 | Top-5 |
|---|---|---|
| Val | 92.89% | 97.91% |
| Test | 91.64% | 98.08% |