farewellthree/BT-Adapter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/farewellthree/BT-Adapter)

farewellthree / BT-Adapter

[CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"

☆35

Alternatives and similar repositories for BT-Adapter

Users that are interested in BT-Adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
farewellthree / PPLLaVA
View on GitHub
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆133Nov 19, 2024Updated last year
patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
princetonvisualai / merv
View on GitHub
Unifying Specialized Visual Encoders for Video Language Models
☆25Nov 22, 2025Updated 8 months ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UCSB-AI / MMWorld
View on GitHub
Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
☆28Jul 15, 2025Updated last year
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
ziplab / SPT
View on GitHub
[ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.
☆76Sep 24, 2023Updated 2 years ago
PeterWang512 / AttributeByUnlearning
View on GitHub
Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."
☆17May 23, 2025Updated last year
farewellthree / Causal-Context-Debiasing
View on GitHub
CCD： Official PyTorch implementation of the paper "Contextual Debiasing for Visual Recognition with Causal Mechanisms"
☆17Jan 26, 2023Updated 3 years ago
mlvlab / DeepVideoR1
View on GitHub
[NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"
☆36Feb 22, 2026Updated 4 months ago
aiming-lab / ReAgent-V
View on GitHub
[NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding
☆51Sep 21, 2025Updated 10 months ago
Allen123321 / Self-Supervised_Learning_Papers-Code
View on GitHub
Self-Supervised learning papers
☆25Jan 19, 2022Updated 4 years ago
patrick-0817 / T-MASS-text-video-retrieval
View on GitHub
Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for …
☆23May 1, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆107Aug 11, 2023Updated 2 years ago
EgoAlpha / Egocentric-Dataset
View on GitHub
☆39Mar 24, 2022Updated 4 years ago
CNVid / CNVid-3.5M
View on GitHub
This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…
☆26Nov 28, 2023Updated 2 years ago
zai-org / LVBench
View on GitHub
[ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark
☆145Jul 9, 2025Updated last year
PKU-YuanGroup / Video-Bench
View on GitHub
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
☆140Dec 31, 2023Updated 2 years ago
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
Zhuzi24 / Video-Dynamic-Scene-Graph-Generation
View on GitHub
☆16May 9, 2024Updated 2 years ago
llyx97 / TempCompass
View on GitHub
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆133Apr 4, 2025Updated last year
TencentARC / ST-LLM
View on GitHub
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
☆153Sep 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RenShuhuai-Andy / TESTA
View on GitHub
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
☆50Jan 9, 2024Updated 2 years ago
ali-vilab / CAPability
View on GitHub
What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness
☆28May 16, 2025Updated last year
RongKaiWeskerMA / INSTA
View on GitHub
The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
☆13Apr 14, 2024Updated 2 years ago
xjtupanda / Sparrow
View on GitHub
Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"
☆48Sep 3, 2025Updated 10 months ago
mapupcal / IntelligentScissor
View on GitHub
IntelligetnScissor implemented by C++.
☆12Apr 20, 2018Updated 8 years ago
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
CASIA-IVA-Lab / VideoNIAH
View on GitHub
VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs
☆57Mar 9, 2025Updated last year
wenhuchen / Semi-Supervised-Image-Captioning
View on GitHub
Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"
☆21Dec 26, 2016Updated 9 years ago
para-lost / RVP
View on GitHub
Recursive Visual Programming (ECCV 2024)
☆18Nov 20, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TerminologyHub / termhub-in-5-minutes
View on GitHub
Developer project for getting basic API integrations working in under 5 minutes
☆11May 22, 2026Updated last month
ilkerkesen / ViLMA
View on GitHub
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
☆16Jan 18, 2024Updated 2 years ago
ZhangYuanhan-AI / NOAH
View on GitHub
[TPAMI] Searching prompt modules for parameter-efficient transfer learning.
☆241Dec 8, 2023Updated 2 years ago
NVlabs / LITA
View on GitHub
☆194Oct 14, 2024Updated last year
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
SilentView / LVD-2M
View on GitHub
[NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"
☆79Oct 15, 2024Updated last year
QiWang98 / VideoRFT
View on GitHub
[NeurIPS 2025] VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
☆65Jan 6, 2026Updated 6 months ago