qixucen / atomLinks
[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling
☆632Updated 2 months ago
Alternatives and similar repositories for atom
Users that are interested in atom are comparing it to the libraries listed below
Sorting:
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆680Updated 9 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 8 months ago
- Prompt-to-Leaderboard☆271Updated 8 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆495Updated 7 months ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆391Updated this week
- ☆433Updated last year
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆663Updated 10 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆428Updated 9 months ago
- ☆867Updated 4 months ago
- Code and data for the Chain-of-Draft (CoD) paper☆338Updated 10 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆377Updated 10 months ago
- ☆1,381Updated 4 months ago
- 👩⚖️ Agent-as-a-Judge: The Magic for Open-Endedness☆707Updated 8 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆490Updated 5 months ago
- The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"☆755Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 6 months ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆812Updated 8 months ago
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆479Updated 2 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 10 months ago
- AWM: Agent Workflow Memory☆378Updated 3 weeks ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆675Updated 6 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆306Updated last month
- Pretraining and inference code for a large-scale depth-recurrent language model☆859Updated 3 weeks ago
- A-MEM: Agentic Memory for LLM Agents☆789Updated last month
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆961Updated 7 months ago
- OpenCUA: Open Foundations for Computer-Use Agents☆636Updated last week
- An agent benchmark with tasks in a simulated software company.☆622Updated 2 months ago
- Integrating Tool Use into LLM Reasoning☆705Updated 11 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,534Updated 7 months ago
- Build your own visual reasoning model☆417Updated last week