qixucen / atom
☆514Updated last week
Alternatives and similar repositories for atom:
Users that are interested in atom are comparing it to the libraries listed below
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆542Updated this week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆735Updated 3 weeks ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆474Updated last week
- ☆438Updated 5 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,389Updated this week
- Verifiers for LLM Reinforcement Learning☆686Updated this week
- free and open OpenAI Deep Research☆480Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,065Updated this week
- An agent benchmark with tasks in a simulated software company.☆273Updated last week
- 🤠 Agent-as-a-Judge and DevAI dataset☆384Updated 2 months ago
- Build your own visual reasoning model☆312Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆425Updated 6 months ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆487Updated 3 weeks ago
- Pretraining code for a large-scale depth-recurrent language model☆697Updated last week
- LIMO: Less is More for Reasoning☆864Updated last month
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,210Updated this week
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆191Updated last month
- ☆485Updated last week
- procedural reasoning datasets☆534Updated this week
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆451Updated this week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆330Updated last month
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆403Updated 2 weeks ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆479Updated last month
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆480Updated last month
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆299Updated 2 weeks ago
- OpenResearcher, an advanced Scientific Research Assistant☆438Updated 5 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆568Updated this week
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆268Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆1,044Updated last month
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆353Updated this week