zjuruizhechen/TVG-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjuruizhechen/TVG-R1)

zjuruizhechen / TVG-R1

[EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning

☆36

Alternatives and similar repositories for TVG-R1

Users that are interested in TVG-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZackZikaiXiao / Awesome-Agent-Environments
View on GitHub
Awesome Agent Environments
☆17Apr 10, 2026Updated 3 months ago
FanZT6 / FairMT-bench
View on GitHub
☆14Mar 7, 2025Updated last year
zjuruizhechen / PAD
View on GitHub
[ICLR 2025] Pad: Personalized alignment of llms at decoding-time
☆20Mar 19, 2025Updated last year
zjuruizhechen / Awesome-Video-Agent
View on GitHub
A collection of awesome think with videos papers.
☆100Dec 1, 2025Updated 7 months ago
Lzq5 / UniTime
View on GitHub
Universal Video Temporal Grounding with Generative Multi-modal Large Language Models
☆56May 20, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JianhongBai / COLT
View on GitHub
Official implementation of "On the Effectiveness of Out-of-Distribution Data in Self-Supervised Long-Tail Learning" (ICLR 2023)
☆15Jul 15, 2023Updated 3 years ago
Time-Search / TimeSearch-R
View on GitHub
[ICLR 2026] Official code for paper: TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinf…
☆27Jan 29, 2026Updated 5 months ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
nusnlp / d2vlm
View on GitHub
[ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models
☆24Apr 18, 2026Updated 3 months ago
fzp0424 / MT-Ladder
View on GitHub
[EMNLP'24] Code and data for paper "Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level"
☆23Jun 29, 2024Updated 2 years ago
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 3 months ago
GiantAILab / DeepSound-V1
View on GitHub
Official code for DeepSound-V1
☆12May 14, 2025Updated last year
xiaomi-research / time-r1
View on GitHub
[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
☆95Dec 14, 2025Updated 7 months ago
Tang-xiaoxiao / 3D-RAD
View on GitHub
[ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks
☆33Jun 22, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZijiaLewisLu / CVPR2025-DeCafNet
View on GitHub
Official Repo for CVPR 2025 Paper -- DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
☆17Mar 16, 2026Updated 4 months ago
fzp0424 / MT-R1-Zero
View on GitHub
[EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"
☆69Apr 15, 2025Updated last year
WHB139426 / Grounded-Video-LLM
View on GitHub
[EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
☆149Aug 21, 2025Updated 11 months ago
iLearn-Lab / TPAMI26-Awesome-MLLMs-for-Video-Temporal-Grounding
View on GitHub
Latest Papers, Codes and Datasets on VTG-LLMs.
☆95Jul 12, 2026Updated last week
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
gyxxyg / TRACE
View on GitHub
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
☆156Aug 22, 2025Updated 11 months ago
JianhongBai / BaCon
View on GitHub
Official implementation of "Towards Distribution-Agnostic Generalized Category Discovery" (NIPS 2023)
☆29Oct 21, 2023Updated 2 years ago
bytedance / F-16
View on GitHub
F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…
☆40Jul 3, 2025Updated last year
pengr / learn-claude-code
View on GitHub
Learn Claude Code — 基于源码的完整技术分析文档集，15章深度解析 Agent Loop、工具系统、权限系统等核心机制
☆27Mar 31, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
JianhuiWei7 / UniVBench
View on GitHub
[CVPR 2026]The official code and datasets for "UniVBench: Towards Unified Evaluation for Video Foundation Models"
☆23May 27, 2026Updated last month
HuiGuanLab / DL-DKD
View on GitHub
Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
☆19May 13, 2026Updated 2 months ago
yeliudev / VideoMind
View on GitHub
🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)
☆348Feb 8, 2026Updated 5 months ago
minghangz / OnVTG
View on GitHub
Online video temporal grounding
☆16Oct 20, 2025Updated 9 months ago
chuntianli666 / CrossVid
View on GitHub
[AAAI 2026] CrossVid: A Comprehensive Benchmark for Evaluating Cross-Video Reasoning in Multimodal Large Language Models
☆23Jul 9, 2026Updated 2 weeks ago
chrisx599 / Video-Browser
View on GitHub
Official code repo of Video-Browser: Towards Agentic Open-web Video Browsing
☆28Jan 19, 2026Updated 6 months ago
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
thqiu0419 / IntentVCNet
View on GitHub
IntentVCNet: Bridging Spatio-Temporal Gaps for Intention-Oriented Controllable Video Captioning
☆19Aug 16, 2025Updated 11 months ago
qirui-chen / RGA3-release
View on GitHub
[ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring
☆24Aug 8, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jiangsongtao / Med-MoE
View on GitHub
[EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"
☆158Jul 7, 2025Updated last year
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆25Jan 26, 2025Updated last year
GiantAILab / DeepDubber-V1
View on GitHub
DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning…
☆30Sep 7, 2025Updated 10 months ago
junwenxiong / diff_sal
View on GitHub
Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
☆29May 26, 2024Updated 2 years ago
sunoh-kim / pps
View on GitHub
Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Super…
☆19Jan 19, 2024Updated 2 years ago
wangpengnorman / KB-Ref_dataset
View on GitHub
☆16Dec 28, 2020Updated 5 years ago
zjucsq / PLA
View on GitHub
[ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision
☆12Sep 17, 2023Updated 2 years ago