zorazrw / agent-skill-inductionLinks
Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"
☆26Updated 3 months ago
Alternatives and similar repositories for agent-skill-induction
Users that are interested in agent-skill-induction are comparing it to the libraries listed below
Sorting:
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆82Updated 2 months ago
- ☆61Updated 2 weeks ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆103Updated 2 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆140Updated 8 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 7 months ago
- ☆20Updated 4 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆60Updated 6 months ago
- Critique-out-Loud Reward Models☆70Updated 9 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆69Updated 2 months ago
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆78Updated 4 months ago
- Natural Language Reinforcement Learning☆92Updated last week
- ☆27Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆59Updated 8 months ago
- WONDERBREAD benchmark + dataset for BPM tasks☆26Updated last week
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆59Updated 8 months ago
- ☆114Updated 6 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆147Updated 9 months ago
- ☆118Updated 5 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆105Updated 3 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆159Updated last month
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 9 months ago
- ☆49Updated 11 months ago
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆22Updated 3 weeks ago
- ☆27Updated 6 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆114Updated 4 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆30Updated last year
- Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".☆39Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆68Updated 3 months ago