[NeurIPS'25 Spotlight] ARM: Adaptive Reasoning Model
☆65Mar 10, 2026Updated last week
Alternatives and similar repositories for arm
Users that are interested in arm are comparing it to the libraries listed below
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆27Aug 9, 2025Updated 7 months ago
- [CVPR'2025] Synthetic Data is an Elegant GIFT for Continual Vision-Language Models☆24Jun 29, 2025Updated 8 months ago
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆103Aug 30, 2025Updated 6 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- A detail Implementation of handling long-term memory in Agentic AI☆41Oct 9, 2025Updated 5 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 8 months ago
- Tracking the latest and greatest research papers on diffusion large language models.☆23Mar 13, 2026Updated last week
- [ECCV 2024] Code for the paper "Mew: Multiplexed Immunofluorescence Image Analysis through an Efficient Multiplex Network"☆17Jul 27, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 5 months ago
- ☆111Dec 10, 2025Updated 3 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆73Dec 8, 2025Updated 3 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 9 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 8 months ago
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- ☆14Jan 6, 2025Updated last year
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 7 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- ☆18Apr 10, 2025Updated 11 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- (ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillat…☆34Aug 23, 2025Updated 6 months ago
- ☆47Apr 9, 2025Updated 11 months ago
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆11Mar 14, 2023Updated 3 years ago
- Documentation at☆14Mar 27, 2025Updated 11 months ago
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆65Sep 27, 2025Updated 5 months ago
- ☆12Jun 12, 2024Updated last year
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆91Jan 29, 2026Updated last month
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆119Feb 16, 2026Updated last month
- ☆16Jul 29, 2025Updated 7 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆201Dec 18, 2025Updated 3 months ago
- ☆16Jun 10, 2025Updated 9 months ago
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- [ITSC'25] LLM-Guided Evaluation and Adversarial Generation of Safety-Critical Driving Scenarios☆23Aug 29, 2025Updated 6 months ago
- ☆20Mar 3, 2025Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- ☆182Dec 5, 2025Updated 3 months ago