ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆31Updated last month
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Simple extension on vLLM to help you speed up reasoning model without training.☆161Updated 3 weeks ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆140Updated 2 weeks ago
- Reproducing R1 for Code with Reliable Rewards☆221Updated last month
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆77Updated 2 weeks ago
- ☆127Updated 2 weeks ago
- Repo of paper "Free Process Rewards without Process Labels"☆153Updated 3 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆108Updated 6 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆94Updated 3 months ago
- A version of verl to support tool use☆261Updated this week
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆28Updated 4 months ago
- The OlymMATH dataset☆16Updated 3 weeks ago
- RL Scaling and Test-Time Scaling (ICML'25)☆106Updated 5 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆48Updated 7 months ago
- ☆190Updated 3 months ago
- ☆28Updated 4 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 7 months ago
- Evaluation utilities based on SymPy.☆20Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆82Updated 4 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆99Updated last month
- Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?☆29Updated 3 weeks ago
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning☆37Updated 2 weeks ago
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆107Updated 2 months ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆422Updated last week
- Async pipelined version of Verl☆100Updated 2 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆59Updated 7 months ago
- ☆71Updated 7 months ago
- ☆65Updated 2 months ago
- ☆220Updated last month
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆17Updated 2 months ago
- ☆172Updated this week