[arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
☆71Dec 17, 2025Updated 5 months ago
Alternatives and similar repositories for SAGE
Users that are interested in SAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open-sourced evaluation suite from the Monitoring Monitorability paper☆79Apr 22, 2026Updated last month
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 7 months ago
- ☆29Apr 8, 2025Updated last year
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆26Aug 24, 2023Updated 2 years ago
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning☆35Jan 14, 2026Updated 5 months ago
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆12Sep 22, 2023Updated 2 years ago
- Code for "Nearest Neighbor Classifier Embedded Network for Active Learning", AAAI 2021☆10Feb 3, 2021Updated 5 years ago
- Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.☆40Jun 7, 2026Updated last week
- This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…☆68Apr 8, 2026Updated 2 months ago
- A lightweight Text-to-Image Retrieval model [Web App]☆29Dec 6, 2024Updated last year
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- Code release for "SegLLM: Multi-round Reasoning Segmentation"☆128Feb 20, 2025Updated last year
- [𝗜𝗖𝗔𝗦𝗦𝗣 𝟮𝟬𝟮𝟱 𝗢𝗿𝗮𝗹] ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sa…☆15May 2, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆30Nov 3, 2025Updated 7 months ago
- Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024☆21Apr 25, 2025Updated last year
- [ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models☆19Sep 25, 2025Updated 8 months ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆128Nov 12, 2025Updated 7 months ago
- [ICLR 2026][Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization☆72May 26, 2026Updated 2 weeks ago
- COCO API Customized for OVIS evaluation☆17Nov 8, 2021Updated 4 years ago
- AI Powered Dockerfile Generator Using Llama3.1 with GROQ☆11Oct 24, 2024Updated last year
- ☆52Sep 13, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆22Oct 21, 2024Updated last year
- Discrete Flow Matching implemented in PyTorch☆34Mar 23, 2025Updated last year
- 주식 시장 관련 지표들을 모아서 보여주고, AI를 통해 시장의 향방을 예측해주는 웹 페이지 입니다.☆50May 12, 2026Updated last month
- ☆15Feb 2, 2025Updated last year
- ☆17Oct 4, 2024Updated last year
- ☆10Dec 23, 2020Updated 5 years ago
- Repo for the testing-genai workshop☆14May 8, 2025Updated last year
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆27Jun 27, 2025Updated 11 months ago
- LLM model runway server☆13Sep 13, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- moodist☆28Apr 23, 2026Updated last month
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'☆29Oct 12, 2022Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆35Mar 31, 2023Updated 3 years ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated 2 years ago
- ☆28Oct 19, 2021Updated 4 years ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 2 years ago