bigai-nlco/VideoTGB

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bigai-nlco/VideoTGB)

bigai-nlco / VideoTGB

[EMNLP 2024] A Video Chat Agent with Temporal Prior

☆33

Alternatives and similar repositories for VideoTGB

Users that are interested in VideoTGB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeapLabTHU / diver-ct
View on GitHub
☆14Dec 19, 2024Updated last year
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
Henry839 / PaperMaster
View on GitHub
☆15Apr 14, 2026Updated 3 months ago
WHB139426 / GCG
View on GitHub
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]
☆10Jul 22, 2024Updated 2 years ago
bigai-nlco / TokenSwift
View on GitHub
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆126May 19, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bigai-ai / ICE
View on GitHub
【ICLR 2025 🔥】The code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overco…
☆56Apr 2, 2025Updated last year
zhiyuanhubj / Long_form_VideoQA
View on GitHub
[EMNLP’24 Main] Encoding and Controlling Global Semantics for Long-form Video Question Answering
☆18Oct 9, 2024Updated last year
doc-doc / CoVGT
View on GitHub
Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)
☆20Mar 9, 2024Updated 2 years ago
patrick-tssn / VSTAR
View on GitHub
[ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information
☆16Oct 27, 2024Updated last year
Shenzhi-Wang / recon
View on GitHub
The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)
☆15Aug 12, 2024Updated last year
ByZ0e / Glance-Focus
View on GitHub
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆31Jun 28, 2024Updated 2 years ago
Andrewzh112 / ExpeL
View on GitHub
☆14Dec 16, 2023Updated 2 years ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 5 months ago
LR32768 / DL_theory_exp
View on GitHub
☆16Apr 12, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆19Apr 2, 2025Updated last year
JZXXX / Semi-SDP
View on GitHub
☆13Feb 22, 2021Updated 5 years ago
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
daswer123 / Voyager_checkpoint
View on GitHub
Checkpoint for Voyager, 160 iterations.
☆23May 27, 2023Updated 3 years ago
bigai-nlco / LooGLE
View on GitHub
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
☆199Oct 8, 2024Updated last year
yueyang130 / SEEM
View on GitHub
Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
☆24Oct 30, 2023Updated 2 years ago
jolin830 / SlowFast-Meet-ViT
View on GitHub
We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …
☆14Nov 11, 2024Updated last year
facebookresearch / EgoVLPv2
View on GitHub
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆110Jul 2, 2024Updated 2 years ago
mlvlab / Flipped-VQA
View on GitHub
Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
☆77Mar 26, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bigai-nlco / UltraVoice
View on GitHub
Official Repository of UltraVoice
☆63Oct 28, 2025Updated 9 months ago
SHI-Labs / IMG-Multimodal-Diffusion-Alignment
View on GitHub
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025
☆30Oct 1, 2025Updated 9 months ago
LeapLabTHU / L2W-DEN
View on GitHub
[ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks
☆38Sep 28, 2023Updated 2 years ago
wxh1996 / VideoAgent
View on GitHub
☆150Apr 16, 2025Updated last year
MaxPolak97 / H3D-Net-reproduction
View on GitHub
☆11May 2, 2022Updated 4 years ago
lianshiwei / datavisualization.github.io
View on GitHub
中国历年GDP和人口数据可视化
☆13Jan 18, 2023Updated 3 years ago
assafbk / mocha_code
View on GitHub
Mitigating Open-Vocabulary Caption Hallucinations (EMNLP 2024)
☆19Oct 18, 2024Updated last year
OpenRL-Lab / PyTorch_Tutorial
View on GitHub
PyTorch使用技巧和教程
☆12Apr 17, 2023Updated 3 years ago
LeapLabTHU / Dynamic_Perceiver
View on GitHub
Official implementation of Dynamic Perceiver
☆44Nov 16, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tho-kn / Ego3DPose
View on GitHub
Official repository of the "Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views" (SIGGRAPH Asia 2023)
☆10Dec 24, 2024Updated last year
zhaoyi11 / adaptive_bc
View on GitHub
☆15Jul 4, 2022Updated 4 years ago
OmniMMI / OpenOmniNexus
View on GitHub
a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.
☆38Apr 7, 2025Updated last year
codesavory / IMAGEimate
View on GitHub
IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks
☆13Dec 9, 2021Updated 4 years ago
jianwang-mpi / GlobalEgoMocap
View on GitHub
The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.
☆12Sep 23, 2023Updated 2 years ago
AntXinyuan / sph2pob
View on GitHub
(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods
☆14Aug 23, 2023Updated 2 years ago
LeapLabTHU / Uni-AdaFocus
View on GitHub
Official repository of Uni-AdaFocus (TPAMI 2024).
☆59Dec 17, 2024Updated last year