Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding
☆69Apr 29, 2026Updated 3 weeks ago
Alternatives and similar repositories for Tempo
Users that are interested in Tempo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- Text-guided 3D texture generation using training-free multi-diffusion in UV space.☆14Apr 7, 2025Updated last year
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆92Apr 30, 2026Updated 3 weeks ago
- [MICCAI 2025] Official code implementation for paper: ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tra…☆41Nov 4, 2025Updated 6 months ago
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆84Dec 12, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGGRAPH 2025] 3D Stylization via Large Reconstruction Model☆32Oct 14, 2025Updated 7 months ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 9 months ago
- CN Dota, Best Dota.☆11Dec 14, 2020Updated 5 years ago
- ☆12Nov 3, 2020Updated 5 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆19Apr 21, 2026Updated last month
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- ☆12Nov 7, 2019Updated 6 years ago
- ☆12Feb 24, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CSU签到、临时离开、签离助手☆12Aug 27, 2022Updated 3 years ago
- TBDATA 2023. STCF: Spatial-Temporal Contrasting for Fine-Grained Urban Flow inference.☆12Mar 12, 2024Updated 2 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- [NIPS2025] RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers☆52Sep 24, 2025Updated 8 months ago
- ☆13Nov 5, 2024Updated last year
- ☆13Jul 22, 2024Updated last year
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- AllCodeForDataStructure☆11Jul 5, 2014Updated 11 years ago
- Optimizing Monocular Depth Estimation with TensorRT: Model Conversion, Inference Acceleration, and 3D Reconstruction☆49Mar 9, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆18Jan 6, 2025Updated last year
- ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation☆15Jun 2, 2024Updated last year
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆105Apr 13, 2026Updated last month
- ☆13Nov 23, 2022Updated 3 years ago
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated 2 weeks ago
- ☆21Apr 15, 2024Updated 2 years ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 3 months ago
- ☆32Apr 8, 2025Updated last year
- Depth-aided Camouflaged Object Detection☆17Oct 18, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Feb 18, 2022Updated 4 years ago
- Code release for our SIGGRAPH 2022 paper "Diffeomorphic Neural Surface Parameterization for 3D and Reflectance Acquisition"☆18Dec 5, 2022Updated 3 years ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- Towards Efficient Multimodal Large Language Models: A Survey on Token Compression☆183May 12, 2026Updated last week
- ☆12Mar 8, 2021Updated 5 years ago
- FlowFeat: Pixel-Dense Embedding of Motion Profiles (NeurIPS 2025 Spotlight)☆114May 13, 2026Updated last week
- Implementation of Prompt-to-Prompt Image Editing with Cross Attention Control☆16Apr 5, 2023Updated 3 years ago