ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆43Updated 8 months ago
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆279Updated 8 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆386Updated last month
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆224Updated 6 months ago
- Async pipelined version of Verl☆124Updated 9 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆115Updated 5 months ago
- A Gym for Agentic LLMs☆420Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,437Updated this week
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆217Updated 7 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆330Updated 8 months ago
- A version of verl to support diverse tool use☆805Updated this week
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆270Updated last year
- ☆255Updated 5 months ago
- ☆326Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 8 months ago
- ☆39Updated 5 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆150Updated last year
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆354Updated last month
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆181Updated this week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆249Updated 8 months ago
- Evaluation utilities based on SymPy.☆21Updated last year
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆141Updated 2 weeks ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆348Updated 3 months ago
- ☆686Updated this week
- NexRL is an ultra-loosely-coupled LLM post-training framework.☆63Updated last month
- [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation☆245Updated last year
- A Comprehensive Benchmark for Software Development.☆124Updated last year
- Awesome List for Agentic RL☆706Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆215Updated last month
- ☆217Updated last week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago