ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆30Updated last month
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆201Updated last month
- Async pipelined version of Verl☆91Updated last month
- Simple extension on vLLM to help you speed up reasoning model without training.☆152Updated this week
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆73Updated last month
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated last month
- A version of verl to support tool use☆172Updated this week
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆94Updated last month
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆29Updated 3 weeks ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆94Updated 2 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆48Updated 7 months ago
- ☆104Updated last month
- ☆25Updated 3 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆59Updated 6 months ago
- The OlymMATH dataset☆15Updated this week
- r2e: turn any github repository into a programming agent environment☆121Updated last month
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆106Updated 5 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆28Updated 3 months ago
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆375Updated this week
- ☆52Updated last week
- ☆70Updated this week
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆99Updated last week
- Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"