This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 8 months ago
Alternatives and similar repositories for PAVE
Users that are interested in PAVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.☆12Oct 15, 2021Updated 4 years ago
- ☆17Dec 23, 2022Updated 3 years ago
- [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆59Oct 9, 2025Updated 7 months ago
- Dataset of measurements from a low-cost single-photon camera used in our CVPR 2024 paper "Towards 3D Vision with Low-Cost Single-Photon C…☆14Nov 24, 2025Updated 6 months ago
- Official Implementation for "ESCAPE: Encoding Super-keypoints for Category-Agnostic Pose Estimation", CVPR 2024.☆10Jun 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Video feature extraction pipeline that supports diverse models including I3D, SlowFast, EgoVLP, and CLIP.☆13Apr 20, 2024Updated 2 years ago
- Official PyTorch implementation of the ICML 2023 paper "Adaptive IMLE for Few-shot Pretraining-free Generative Modelling "☆16Feb 13, 2025Updated last year
- Code release for DeepEDM (ICML 2025)☆29Jan 20, 2026Updated 4 months ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 5 months ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- ☆20Oct 5, 2023Updated 2 years ago
- ☆33Feb 17, 2026Updated 3 months ago
- Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)☆13Nov 9, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Nov 16, 2020Updated 5 years ago
- fork from https://github.com/jwyang/faster-rcnn.pytorch☆10Aug 6, 2018Updated 7 years ago
- [ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model☆17Feb 13, 2025Updated last year
- Official Implementation of Video-MA2MBA☆12Dec 3, 2024Updated last year
- ☆17Nov 8, 2023Updated 2 years ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆56Jul 1, 2025Updated 10 months ago
- Official implementation of POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples (NeurIPS 2021)☆14Aug 6, 2022Updated 3 years ago
- [ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges☆84Feb 27, 2025Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…☆26Jun 4, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- LaTeX template for the undergraduate thesis of Central South University☆14Dec 4, 2019Updated 6 years ago
- Official code repository of Shuffle-R1☆26Feb 23, 2026Updated 3 months ago
- A Fast PyTorch implementation for ICCV 19 paper "BMN: Boundary-Matching Network for Temporal Action Proposal Generation"☆10Jul 29, 2019Updated 6 years ago
- [ICML 2024 Oral] LSH-Based Efficient Point Transformer (HEPT)☆26Jan 24, 2025Updated last year
- Official Implementation of SnAG (CVPR 2024)☆59Apr 26, 2025Updated last year
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆100Apr 4, 2023Updated 3 years ago
- ☆24May 19, 2023Updated 3 years ago
- Tensorflow implementation for paper: Learning to Compare: Relation Network for Few-Shot Learning.☆17Apr 4, 2020Updated 6 years ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆34Feb 22, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆29Apr 8, 2025Updated last year
- Inductive and Transductive Few-Shot Video Classification via Appearance and Temporal Alignments (ECCV 2022)☆26Nov 12, 2024Updated last year
- Streaming Video Instruction Tuning☆75Feb 25, 2026Updated 3 months ago
- Deep learning tutorial☆29Jan 29, 2018Updated 8 years ago
- ☆31Jan 18, 2026Updated 4 months ago
- This is sample source code for Reinforcement Learning Competition, hosted by FPT-Software (Hanoi, Vietnam). The game is Gold Miner.☆27Sep 25, 2020Updated 5 years ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year