Scaling Long-Horizon LLM Agent via Context-Folding
☆148Jan 26, 2026Updated 3 months ago
Alternatives and similar repositories for FoldAgent
Users that are interested in FoldAgent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PARL (Parallel-Agent Reinforcement Learning) is a training paradigm that teaches models to decompose complex tasks into parallel subtasks…☆40Mar 24, 2026Updated last month
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- ☆78Mar 23, 2026Updated last month
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution☆103Sep 24, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jul 11, 2023Updated 2 years ago
- [NeurIPS 2025] Mind the Gap: Bridging Thought Leap for Improved CoT Tuning https://arxiv.org/abs/2505.14684☆48Oct 20, 2025Updated 6 months ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆246Aug 27, 2025Updated 8 months ago
- ☆13Mar 5, 2025Updated last year
- Dateset Reset Policy Optimization☆31Apr 12, 2024Updated 2 years ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark☆127Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆21Sep 7, 2025Updated 7 months ago
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆22Jul 10, 2023Updated 2 years ago
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 11 months ago
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆162Jun 26, 2025Updated 10 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆188Jul 23, 2025Updated 9 months ago
- ☆20Nov 3, 2024Updated last year
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆60Jul 24, 2025Updated 9 months ago
- ☆17Feb 26, 2024Updated 2 years ago
- OmniGAIA: Towards Native Omni-Modal AI Agents☆124Apr 2, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]☆25Nov 3, 2024Updated last year
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- ☆22Jun 11, 2024Updated last year
- ☆14Dec 18, 2024Updated last year
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆100Oct 27, 2025Updated 6 months ago
- ☆171Nov 26, 2025Updated 5 months ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆51Jan 5, 2026Updated 3 months ago
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆74Oct 15, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆30Oct 30, 2023Updated 2 years ago
- PROSE Public Benchmark Suite☆32Sep 15, 2025Updated 7 months ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Official code implementation of the paper: QuCo-RAG: Quantifying Uncertainty from the Pre-training Corpus for Dynamic Retrieval-Augmente…☆38Apr 8, 2026Updated 3 weeks ago
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆25Apr 14, 2025Updated last year
- An open-source clone of Kerbal Space Program created in Unity.☆14Mar 12, 2018Updated 8 years ago
- Fork to run instances from SWE-rebench☆24Apr 22, 2026Updated last week