ulab-uiuc / research-town
A platform for developers to simulate collaborative research activities
☆141Updated this week
Alternatives and similar repositories for research-town:
Users that are interested in research-town are comparing it to the libraries listed below
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆73Updated 2 months ago
- ☆291Updated 3 months ago
- A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…☆249Updated 3 weeks ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆52Updated 4 months ago
- ☆91Updated 3 months ago
- A banchmark list for evaluation of large language models.☆87Updated last week
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆136Updated 10 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆229Updated last month
- ☆217Updated 7 months ago
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆107Updated 6 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆76Updated 2 weeks ago
- ☆102Updated 3 months ago
- connecting humans and agents☆78Updated 3 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆69Updated 3 weeks ago
- ☆41Updated 5 months ago
- A simple unified framework for evaluating LLMs☆204Updated 2 weeks ago
- ☆143Updated 3 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 10 months ago
- A brief and partial summary of RLHF algorithms.☆124Updated 2 weeks ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆179Updated 7 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆62Updated last month