ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆41Updated 7 months ago
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆277Updated 7 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆208Updated 5 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆366Updated 3 weeks ago
- Async pipelined version of Verl☆125Updated 8 months ago
- ☆319Updated 6 months ago
- ☆245Updated 4 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆116Updated 4 months ago
- A version of verl to support diverse tool use☆722Updated 2 weeks ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Updated 7 months ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆331Updated 3 weeks ago
- ☆70Updated 8 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 7 months ago
- A Gym for Agentic LLMs☆395Updated last month
- ☆217Updated 8 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆332Updated 2 months ago
- MCPToolBench++ MCP Model Context Protocol Tool Use Benchmark on AI Agent and Model Tool Use Ability☆36Updated 2 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆212Updated 6 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆145Updated last year
- ☆201Updated 4 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆248Updated 7 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,325Updated last week
- Evaluation utilities based on SymPy.☆21Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆213Updated 2 weeks ago
- ☆38Updated 4 months ago
- Awesome List for Agentic RL☆585Updated this week
- ☆302Updated 4 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆270Updated last year
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆119Updated last year
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 7 months ago