ulab-uiuc / CS598-Topics-in-LLM-Agents
Course information for CS598-Topics in LLM Agents(25'Spring) under the direction of Prof. Jiaxuan You ( jiaxuan@illinois.edu ).
☆23Updated last week
Alternatives and similar repositories for CS598-Topics-in-LLM-Agents:
Users that are interested in CS598-Topics-in-LLM-Agents are comparing it to the libraries listed below
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆56Updated 5 months ago
- Async pipelined version of Verl☆60Updated 2 weeks ago
- ☆97Updated this week
- Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆64Updated this week
- ☆63Updated 5 months ago
- e☆31Updated this week
- Simple extension on vLLM to help you speed up reasoning model without training.☆146Updated this week
- ☆22Updated 2 months ago
- ☆157Updated last month
- DafnyBench: A Benchmark for Formal Software Verification☆31Updated 4 months ago
- NeurIPS 2024 tutorial on LLM Inference☆42Updated 4 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆95Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆175Updated last month
- Reproducing R1 for Code with Reliable Rewards☆179Updated last week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆80Updated 3 weeks ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆104Updated 4 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆94Updated 2 weeks ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆50Updated 2 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 5 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆86Updated 2 weeks ago
- Critique-out-Loud Reward Models☆62Updated 6 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 5 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆23Updated 2 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆134Updated 5 months ago
- ☆107Updated 3 months ago
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆13Updated 4 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆46Updated 5 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆143Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆166Updated last week