acherstyx / Compressed-Video-Reader

A video reader for extracting motion vectors and residuals from encoded H.264 videos.

☆21

Alternatives and similar repositories for Compressed-Video-Reader:

Users that are interested in Compressed-Video-Reader are comparing it to the libraries listed below

zhaoyue-zephyrus / AVION
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
☆126Updated 8 months ago
facebookresearch / EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆96Updated 8 months ago
TengdaHan / TemporalAlignNet
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆116Updated last year
ByZ0e / Glance-Focus
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆26Updated 9 months ago
j-min / HiREST
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
☆100Updated 2 months ago
ninatu / howtocaption
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆51Updated 5 months ago
HengLan / CGSTVG
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆51Updated 9 months ago
antoyang / TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆177Updated last year
jbistanbul / MiniROAD
☆33Updated 10 months ago
alibaba-mmai-research / DiST
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Updated last year
yeliudev / R2-Tuning
🌀 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
☆80Updated 8 months ago
houzhijian / GroundNLQ
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
☆16Updated last year
farewellthree / STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆99Updated last year
r-cui / ViGA
"Video Moment Retrieval from Text Queries via Single Frame Annotation" in SIGIR 2022
☆69Updated 2 years ago
GenjiB / ECLIPSE
☆31Updated 2 years ago
mlvlab / vid-TLDR
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
☆46Updated 10 months ago
facebookresearch / htstep
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆17Updated last year
acherstyx / CoCap
[ICCV 2023] Accurate and Fast Compressed Video Captioning
☆39Updated last year
jpthu17 / HBI
[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
☆113Updated 3 months ago
jinhyunj / EaTR
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆50Updated last year
ttgeng233 / UnAV
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
☆63Updated last year
TengdaHan / AutoAD
[CVPR'23 Highlight] AutoAD: Movie Description in Context.
☆94Updated 4 months ago
ttgeng233 / LongVALE
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))
☆18Updated this week
zjr2000 / LLMVA-GEBC
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
☆30Updated last year
HJYao00 / Side4Video
☆38Updated 11 months ago
facebookresearch / ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆40Updated 11 months ago
md-mohaiminul / ViS4mer
☆54Updated 2 years ago
sauradip / DiffusionTAD
[ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"
☆36Updated 2 years ago
waybarrios / guidance-based-video-grounding
[ICCV 2023] The official PyTorch implementation of the paper: "Localizing Moments in Long Video Via Multimodal Guidance"
☆19Updated 6 months ago
fmu2 / snag_release
Official Implementation of SnAG (CVPR 2024)
☆44Updated 5 months ago