ADaM-BJTU / O1-CODER
AN O1 REPLICATION FOR CODING
☆222Updated last week
Alternatives and similar repositories for O1-CODER:
Users that are interested in O1-CODER are comparing it to the libraries listed below
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆255Updated this week
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆366Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆397Updated 2 months ago
- ☆586Updated 2 weeks ago
- ☆320Updated 6 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆186Updated 2 months ago
- ☆174Updated 3 weeks ago
- connecting humans and agents☆63Updated last week
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆200Updated 3 weeks ago
- Large Reasoning Models☆718Updated 2 weeks ago
- ☆230Updated 4 months ago
- ☆998Updated 3 weeks ago
- ☆289Updated 2 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆201Updated 2 months ago
- AWM: Agent Workflow Memory☆218Updated 3 weeks ago
- This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.☆135Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents☆260Updated 6 months ago
- FireAct: Toward Language Agent Fine-tuning☆259Updated last year
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆73Updated 2 weeks ago
- Towards Large Multimodal Models as Visual Foundation Agents☆142Updated 3 weeks ago
- ☆97Updated 4 months ago
- Expert Specialized Fine-Tuning☆150Updated 2 months ago
- [ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step☆244Updated 8 months ago
- An implemtation of Everyting of Thoughts (XoT).☆135Updated 9 months ago
- ☆78Updated 3 weeks ago
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆219Updated 4 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆173Updated 2 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆151Updated last week
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆166Updated this week
- Generative Judge for Evaluating Alignment☆220Updated 11 months ago