fmu2/snag_release

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fmu2/snag_release)

fmu2 / snag_release

Official Implementation of SnAG (CVPR 2024)

☆59

Alternatives and similar repositories for snag_release

Users that are interested in snag_release are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

happyharrycn / actionformer_release
View on GitHub
Code release for ActionFormer (ECCV 2022)
☆571Apr 11, 2024Updated 2 years ago
solicucu / D3G
View on GitHub
☆15Oct 30, 2023Updated 2 years ago
yingsen1 / UniMD
View on GitHub
UniMD: Towards Unifying Moment retrieval and temporal action Detection
☆57Jul 5, 2024Updated 2 years ago
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
StarsThu2016 / ApproxDet
View on GitHub
☆12Nov 16, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
houzhijian / CONE
View on GitHub
[2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
☆31Aug 5, 2023Updated 2 years ago
sudo-Boris / mr-Blip
View on GitHub
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆95Mar 9, 2025Updated last year
fmu2 / gradfeat20
View on GitHub
Gradients as Features for Deep Representation Learning
☆43Mar 8, 2020Updated 6 years ago
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
rxtan2 / Koala-video-llm
View on GitHub
☆37Sep 16, 2024Updated last year
yeliudev / R2-Tuning
View on GitHub
🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
☆91Jul 2, 2024Updated 2 years ago
abrarmajeedi / rica2_aqa
View on GitHub
Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)
☆15Nov 9, 2025Updated 8 months ago
sunoh-kim / pps
View on GitHub
Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Super…
☆19Jan 19, 2024Updated 2 years ago
fmu2 / nlos3d
View on GitHub
☆18Dec 23, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
zhuoyan-xu / Foundation-Model_Multitask
View on GitHub
☆17Mar 14, 2024Updated 2 years ago
zjr2000 / GVL
View on GitHub
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
☆28Dec 8, 2023Updated 2 years ago
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆56May 20, 2026Updated 2 months ago
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 4 months ago
ttgeng233 / UniAV
View on GitHub
Unified Audio-Visual Perception for Multi-Task Video Localization
☆33Apr 19, 2024Updated 2 years ago
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
sming256 / ETAD
View on GitHub
[CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection
☆19Oct 3, 2024Updated last year
HYUNJS / STOV-TAL
View on GitHub
[WACV-2025] Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
☆17May 28, 2025Updated last year
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / ProcedureVRL
View on GitHub
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆56Aug 8, 2023Updated 2 years ago
TimeMarker-LLM / TimeMarker
View on GitHub
A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability
☆107Nov 28, 2024Updated last year
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
Tanveer81 / RGNet
View on GitHub
This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos
☆20Mar 3, 2025Updated last year
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
lgzlIlIlI / Boosting-WTAL
View on GitHub
☆48Sep 22, 2023Updated 2 years ago
Soldelli / VLG-Net
View on GitHub
VLG-Net: Video-Language Graph Matching Networks for Video Grounding
☆31May 31, 2022Updated 4 years ago
sangminwoo / Explore-And-Match
View on GitHub
Official pytorch implementation of "Explore-And-Match: Bridging Proposal-Based and Proposal-Free With Transformer for Sentence Grounding …
☆42Aug 5, 2022Updated 3 years ago
HuiGuanLab / DL-DKD
View on GitHub
Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
☆19May 13, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gyxxyg / VTG-LLM
View on GitHub
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
☆130Dec 10, 2024Updated last year
hlchen23 / VERIFIED
View on GitHub
Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…
☆40Jan 20, 2025Updated last year
afcedf / SOONet
View on GitHub
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
☆30Jun 24, 2024Updated 2 years ago
Hon-Wong / ByteVideoLLM
View on GitHub
[ICCV 2025] Dynamic-VLM
☆28Dec 16, 2024Updated last year
linkangheng / Video-UTR
View on GitHub
[ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs
☆61Feb 27, 2025Updated last year
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆28Jun 4, 2026Updated last month
LTContext / LTContext
View on GitHub
[ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?
☆50Jun 21, 2024Updated 2 years ago