[ICML 2025] Official Implementation of GLIDER
☆74Oct 9, 2025Updated 8 months ago
Alternatives and similar repositories for GLIDER
Users that are interested in GLIDER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2024] Official Implementation of ACORM☆66Mar 26, 2024Updated 2 years ago
- [NeurIPS 2024] Official Implementation of Meta-DT☆57Oct 16, 2024Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆75Jul 13, 2025Updated 11 months ago
- ☆20Oct 12, 2025Updated 8 months ago
- instruction-following benchmark for large reasoning models☆48Apr 19, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping☆66May 22, 2025Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- [CVPR 2026] TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models☆67Feb 21, 2026Updated 4 months ago
- Code repository for the ICML 2026 paper "Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation".☆24Jun 14, 2026Updated 2 weeks ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆80Jul 10, 2025Updated 11 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆92Feb 15, 2025Updated last year
- ☆138Feb 4, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The repository of EMNLP 2023 "A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check"☆21Nov 17, 2023Updated 2 years ago
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Mar 4, 2024Updated 2 years ago
- [NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervision☆17Jun 1, 2025Updated last year
- Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.☆56Nov 10, 2025Updated 7 months ago
- 🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training☆93Dec 3, 2024Updated last year
- Test-time preferenece optimization (ICML 2025).☆185May 8, 2025Updated last year
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations [ICML…☆185Mar 29, 2026Updated 3 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆193Mar 6, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [EMNLP 2025] CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward☆69Aug 10, 2025Updated 10 months ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- code for paper "Large Language Models as End-to-end Combinatorial Optimization Solvers"☆83Oct 24, 2025Updated 8 months ago
- ☆17Dec 18, 2020Updated 5 years ago
- An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks☆56Jun 18, 2026Updated last week
- Open-source codebase for MAZero, from "Efficient Multi-agent Reinforcement Learning by Planning" at ICLR 2024.☆45May 8, 2024Updated 2 years ago
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- ☆71Jul 8, 2025Updated 11 months ago
- [ICLR 2026]🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, mul…☆213Dec 10, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆59Sep 2, 2024Updated last year
- [ICRA 2023] Sim2Real^2: Actively Building Explicit Physics Model for Precise Articulated Object Manipulation☆24Aug 21, 2023Updated 2 years ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆453Mar 20, 2026Updated 3 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆50Jul 22, 2025Updated 11 months ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆45Mar 8, 2026Updated 3 months ago
- [COLM 2025] An Open Math Pre-trainng Dataset with 370B Tokens.☆111Apr 4, 2025Updated last year
- Your efficient and accurate answer verification system for RL training.☆42Jun 23, 2025Updated last year