ulab-uiuc / research-town
[ICML 2025] A platform for developers to simulate collaborative research activities
☆154Updated this week
Alternatives and similar repositories for research-town
Users that are interested in research-town are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆92Updated 2 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆188Updated last week
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆119Updated 7 months ago
- ☆311Updated 5 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆75Updated 4 months ago
- ☆55Updated last month
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆85Updated 2 weeks ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆85Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆130Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆144Updated 3 weeks ago
- A brief and partial summary of RLHF algorithms.☆128Updated 2 months ago
- 🤝 The code for "Can Large Language Model Agents Simulate Human Trust Behaviors?"☆81Updated last month
- ☆153Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- A curated paper list on LLM reasoning.☆87Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆76Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆198Updated last week
- LLM for Scientific Research Survey☆84Updated 3 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆51Updated 2 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆58Updated last month
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆265Updated 2 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆186Updated 9 months ago
- ☆42Updated 6 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆96Updated 6 months ago
- ☆176Updated 2 weeks ago
- A Comprehensive Survey on Long Context Language Modeling☆139Updated last month
- An Open Math Pre-trainng Dataset with 370B Tokens.☆80Updated last month
- [ICML'25] Multi-agent Architecture Search via Agentic Supernet☆52Updated last week
- "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆73Updated last month