[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
☆49Feb 28, 2026Updated 3 months ago
Alternatives and similar repositories for LongVideo-R1
Users that are interested in LongVideo-R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated last year
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labels☆18Jul 2, 2024Updated last year
- ☆12Sep 19, 2021Updated 4 years ago
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆221Dec 19, 2025Updated 5 months ago
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Jun 2, 2026Updated 2 weeks ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- ☆17Dec 12, 2019Updated 6 years ago
- zoloy的后端学习之旅,其中包括开发规范,学习方法,心得体会,学习资源等,适用于所有想要 学习Java后端的朋友们。☆20Sep 11, 2024Updated last year
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 5 years ago
- This Repository is "SSL for Image Representation", one of the OpenLab of the PseudoLab.☆14Sep 11, 2023Updated 2 years ago
- ☆73Apr 21, 2026Updated last month
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆21Dec 14, 2025Updated 6 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆35Jun 7, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …☆90Aug 25, 2025Updated 9 months ago
- a set of utils for comfyui lora operation☆32Apr 14, 2026Updated 2 months ago
- [NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models☆23Oct 21, 2025Updated 7 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆46Jul 1, 2025Updated 11 months ago
- OmniSVG: A Unified Scalable Vector Graphics Generation Model,you can try it in ComfyUI☆29Dec 5, 2025Updated 6 months ago
- ☆11Sep 19, 2025Updated 8 months ago
- ☆31Oct 8, 2025Updated 8 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆82May 2, 2026Updated last month
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆66Mar 16, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆29Sep 5, 2024Updated last year
- 'Discretization-Aware Architecture Search' alleviates the discretization gap in one-shot differentiable NAS. DAAS has been accepted by PR…☆20Jul 30, 2021Updated 4 years ago
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆66Feb 25, 2026Updated 3 months ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- The Pytorch implementation for "GraFormer: Graph Convolution Transformer for 3D Pose Estimation" https://arxiv.org/pdf/2109.08364.pdf☆56Nov 24, 2021Updated 4 years ago
- [NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning☆106Sep 19, 2025Updated 8 months ago
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- ☆14Jul 17, 2025Updated 11 months ago
- ☆18Oct 16, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The implementation of GOLD_NAS☆24Feb 17, 2025Updated last year
- ☆29May 26, 2025Updated last year
- ☆22Jun 6, 2024Updated 2 years ago
- Mini library for collecting images from google streets view. Generally designed for collecting datasets for ML☆11Nov 15, 2021Updated 4 years ago
- 将视频分割,提取分镜☆92Dec 4, 2025Updated 6 months ago
- [ICCV2023] Joint-Relation Transformer for Multi-Person Motion Prediction☆28Sep 20, 2023Updated 2 years ago
- Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"☆43Jan 6, 2022Updated 4 years ago