ulab-uiuc / CS598-Topics-in-LLM-AgentsLinks
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆32Updated 3 months ago
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
Sorting:
- Reproducing R1 for Code with Reliable Rewards☆246Updated 3 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆698Updated this week
- A version of verl to support tool use☆315Updated this week
- ☆263Updated 2 months ago
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆142Updated 3 weeks ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆185Updated this week
- Async pipelined version of Verl☆112Updated 4 months ago
- Paper list for Efficient Reasoning.☆586Updated this week
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆243Updated this week
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving☆232Updated this week
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆245Updated 3 months ago
- ☆250Updated 2 weeks ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆234Updated this week
- Simple extension on vLLM to help you speed up reasoning model without training.☆174Updated 2 months ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆307Updated 3 months ago
- ☆174Updated 3 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆233Updated 3 months ago
- Building a comprehensive and handy list of papers for GUI agents☆456Updated last month
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆99Updated 3 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆117Updated 5 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆237Updated 2 months ago
- ☆65Updated 3 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆100Updated this week
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆80Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆170Updated last month
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.☆243Updated 3 weeks ago
- A Comprehensive Benchmark for Software Development.☆111Updated last year
- A banchmark list for evaluation of large language models.☆134Updated last month
- Scaling Data for SWE-agents☆342Updated last week
- ☆237Updated 11 months ago