ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆41Updated 6 months ago
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆271Updated 6 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆348Updated this week
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆188Updated 4 months ago
- ☆293Updated 4 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,202Updated last week
- Async pipelined version of Verl☆125Updated 7 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆206Updated 5 months ago
- A Gym for Agentic LLMs☆361Updated last week
- ☆195Updated 3 months ago
- A version of verl to support diverse tool use☆701Updated this week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆323Updated 6 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆248Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆250Updated 6 months ago
- Awesome List for Agentic RL☆542Updated 2 weeks ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆312Updated last week
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆264Updated last year
- ☆309Updated 5 months ago
- ☆67Updated 7 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆114Updated 3 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆318Updated 2 months ago
- ☆215Updated 7 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 7 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆116Updated 11 months ago
- ☆241Updated last year
- ☆231Updated 3 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆127Updated 3 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆290Updated last month
- ☆54Updated 2 years ago
- ☆34Updated 9 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆185Updated 3 weeks ago