[NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684
☆48Oct 20, 2025Updated 8 months ago
Alternatives and similar repositories for Mind-the-Gap
Users that are interested in Mind-the-Gap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆54Nov 4, 2025Updated 7 months ago
- [ECCV 2026] ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆77Mar 9, 2026Updated 3 months ago
- ☆32Aug 11, 2025Updated 10 months ago
- [ICLR 2026] InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models☆56May 5, 2026Updated last month
- ☆28Aug 19, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository is the official implementation of TimeHC-RL (Distilabel (Data Generation) + TRL (SFT) + VeRL (GRPO)).☆48Jun 4, 2025Updated last year
- [AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615☆67Nov 8, 2025Updated 7 months ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆276Apr 30, 2026Updated last month
- Official code for "KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation"☆72Jun 13, 2026Updated 2 weeks ago
- A curated collection of resources, tools, and frameworks for developing GUI Agents.☆432Jun 2, 2026Updated 3 weeks ago
- Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"☆339May 20, 2026Updated last month
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆310Apr 15, 2026Updated 2 months ago
- On Policy Distillation Build on top of Verl☆87May 25, 2026Updated last month
- ☆51Jul 22, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 11 months ago
- An Advanced Basic Math Reasoning and Overthinking Evaluation Framework for LLMs☆12Apr 20, 2026Updated 2 months ago
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)☆20Apr 16, 2025Updated last year
- ☆68Jan 29, 2026Updated 4 months ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated 2 years ago
- ☆20May 14, 2025Updated last year
- [ICML'26] Scaling Long-Horizon LLM Agent via Context-Folding☆162May 18, 2026Updated last month
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆197Mar 17, 2025Updated last year
- [CVPR 2025] APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers☆44Apr 7, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Feb 27, 2025Updated last year
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆13Sep 22, 2025Updated 9 months ago
- This repository includes the data and scripts utilized in the study titled "Improving LLM-based Verilog Code Generation with Data Augment…☆14Mar 24, 2025Updated last year
- Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)☆21Nov 21, 2024Updated last year
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 11 months ago
- Documentation at☆14Mar 27, 2025Updated last year
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 9 months ago
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆31Jul 13, 2025Updated 11 months ago
- Happily_Do_USTB大物实验☆23May 27, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- auto star for repo lists☆10Aug 26, 2023Updated 2 years ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆30May 27, 2026Updated last month
- Easy and Efficient dLLM Fine-Tuning☆261Mar 2, 2026Updated 3 months ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆39Nov 9, 2025Updated 7 months ago
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆21Mar 21, 2025Updated last year
- Ziren(formerly zkMIPS): An open-source, simple, stable, and universal zkVM on MIPS32.☆117Updated this week
- Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.☆54Mar 8, 2026Updated 3 months ago