ttgeng233 / LongVALE

LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))
18Updated this week

Alternatives and similar repositories for LongVALE:

Users that are interested in LongVALE are comparing it to the libraries listed below