[arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
☆64Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for SAGE
Users that are interested in SAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Apr 8, 2025Updated last year
- Empowering Data Driven insights through hands-on projects, SQL challenges and practical tools.☆24Mar 7, 2026Updated last month
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆25Jan 14, 2026Updated 3 months ago
- An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training☆15Aug 9, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.☆32Updated this week
- Code for "Nearest Neighbor Classifier Embedded Network for Active Learning", AAAI 2021☆10Feb 3, 2021Updated 5 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆53Apr 9, 2024Updated 2 years ago
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆59Mar 14, 2026Updated last month
- Dr. Wang' repository☆12Nov 30, 2019Updated 6 years ago
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆128Feb 20, 2025Updated last year
- [ICASSP 2025 Oral] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitud…☆14Feb 14, 2026Updated last month
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆30Nov 3, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 11 months ago
- [ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models☆18Sep 25, 2025Updated 6 months ago
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆123Nov 12, 2025Updated 5 months ago
- [CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI☆64Updated this week
- COCO API Customized for OVIS evaluation☆17Nov 8, 2021Updated 4 years ago
- ☆17Jul 23, 2024Updated last year
- Official PyTorch implementation of QwT—“Quantization without Tears” (CVPR 2025): fast, accurate, and hassle-free post-training network qu…☆33Sep 30, 2025Updated 6 months ago
- ☆16Oct 4, 2024Updated last year
- ☆50Sep 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆22Oct 21, 2024Updated last year
- Discrete Flow Matching implemented in PyTorch☆34Mar 23, 2025Updated last year
- A Massive Multi-Discipline Lecture Understanding Benchmark☆34Nov 1, 2025Updated 5 months ago
- Separating Anything from Image in Context☆12May 29, 2024Updated last year
- A space dedicated for our universe.☆17Feb 10, 2024Updated 2 years ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆26Jun 27, 2025Updated 9 months ago
- Official code for paper "GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable R…☆55Mar 29, 2026Updated 2 weeks ago
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'☆29Oct 12, 2022Updated 3 years ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated last year
- ☆28Oct 19, 2021Updated 4 years ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 2 years ago
- ☆60May 13, 2025Updated 11 months ago
- ☆14Mar 26, 2025Updated last year
- A high-performance PDF summarization tool powered by Google's Gemma 3 LLM. Features parallel processing, async operations, and intelligen…☆21Apr 12, 2025Updated last year
- ☆21May 12, 2024Updated last year