ynulihao / AgentSkillOSLinks
Build your agent from 90,000+ skills via skill RETRIEVAL & ORCHESTRATION
☆40Updated this week
Alternatives and similar repositories for AgentSkillOS
Users that are interested in AgentSkillOS are comparing it to the libraries listed below
Sorting:
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆406Updated 3 months ago
- Awesome List for Agentic RL☆738Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆281Updated 11 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆411Updated 2 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆704Updated 4 months ago
- ☆489Updated 3 months ago
- The official code of ARPO & AEPO☆872Updated 3 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆152Updated 3 months ago
- ☆332Updated 8 months ago
- A version of verl to support diverse tool use☆852Updated 3 weeks ago
- ☆1,482Updated last week
- Official implementation of the NeurIPS 2024 paper CORY☆27Updated last month
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆414Updated 6 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆496Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,184Updated 2 months ago
- ☆19Updated 6 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆601Updated 6 months ago
- llm & rl☆268Updated 3 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,471Updated this week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆688Updated last year
- A series of technical report on Slow Thinking with LLM☆758Updated 5 months ago
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆84Updated 3 weeks ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆62Updated last month
- A RL Framework for multi LLM agent system☆92Updated this week
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,291Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆72Updated 9 months ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆17Updated 11 months ago
- ☆266Updated 5 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆97Updated 6 months ago