[arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
☆64Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for SAGE
Users that are interested in SAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"☆79Feb 1, 2026Updated last month
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 4 months ago
- ☆28Apr 8, 2025Updated 11 months ago
- Empowering Data Driven insights through hands-on projects, SQL challenges and practical tools.☆24Mar 7, 2026Updated 2 weeks ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 5 months ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- State-Relabeling Adversarial Active Learning☆14Aug 17, 2021Updated 4 years ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 10 months ago
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆13Sep 22, 2023Updated 2 years ago
- Google 拼音输入法☆12Sep 16, 2019Updated 6 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆54Apr 9, 2024Updated last year
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆59Mar 14, 2026Updated last week
- Dr. Wang' repository☆12Nov 30, 2019Updated 6 years ago
- ☆72Updated this week
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [ICLR 2026][Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization☆56Updated this week
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆128Feb 20, 2025Updated last year
- [CVPR 2026] PAI-Bench: A Comprehensive Benchmark for Physical AI☆57Feb 21, 2026Updated last month
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆30Nov 3, 2025Updated 4 months ago
- ☆14Jun 12, 2024Updated last year
- Official code for "Vision Transformers with Self-Distilled Registers" (NeurIPS 2025 Spotlight)☆32Dec 6, 2025Updated 3 months ago
- 使用mnn-llm对GOT-OCR2.0进行推理☆13Oct 2, 2024Updated last year
- iMessage RAG MCP Server from Anthropic MCP Hackathon (NYC)☆14Mar 10, 2025Updated last year
- Repo for the testing-genai workshop☆13May 8, 2025Updated 10 months ago
- Official PyTorch implementation of QwT—“Quantization without Tears” (CVPR 2025): fast, accurate, and hassle-free post-training network qu…☆32Sep 30, 2025Updated 5 months ago
- ☆17Jul 23, 2024Updated last year
- AI Powered Dockerfile Generator Using Llama3.1 with GROQ☆11Oct 24, 2024Updated last year
- ☆16Oct 4, 2024Updated last year
- ☆22Oct 21, 2024Updated last year
- Discrete Flow Matching implemented in PyTorch☆35Mar 23, 2025Updated last year
- ☆15Feb 2, 2025Updated last year
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆25Jun 27, 2025Updated 8 months ago
- moodist☆25Mar 13, 2026Updated last week
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'☆29Oct 12, 2022Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆36Mar 31, 2023Updated 2 years ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆124Apr 26, 2024Updated last year
- [CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding☆42Feb 5, 2026Updated last month