[ICML 2025] ResearchTown: Simulator of Human Research Community
☆196Mar 3, 2026Updated this week
Alternatives and similar repositories for research-town
Users that are interested in research-town are comparing it to the libraries listed below
Sorting:
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated 2 months ago
- Create awesome games with GPT☆32Mar 21, 2023Updated 2 years ago
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated last month
- [ICML 2025]"Graph World Model", Tao Feng, Yexin Wu, Guanyu Lin, Jiaxuan You☆30Sep 20, 2025Updated 5 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- [ACL'25 Main] Graph of Records: Boosting Retrieval Augmented Generation for Long-context Summarization with Graphs☆39May 26, 2025Updated 9 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆50Dec 25, 2025Updated 2 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆37Oct 7, 2025Updated 5 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- [IEEE VL/HCC'25]Frontend Diffusion is an end-to-end LLM-powered tool that generates high-quality websites from user sketches.☆19Oct 10, 2025Updated 4 months ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆16Jul 23, 2024Updated last year
- (ACL-2025 main conference) Dolphin: Moving Towards Closed-loop Auto-research through Thinking, Practice, and Feedback☆39Jun 24, 2025Updated 8 months ago
- GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators☆47Dec 23, 2025Updated 2 months ago
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆281Jan 23, 2026Updated last month
- [ICLR 2025] "GraphRouter: A Graph-based Router for LLM Selections", Tao Feng, Yanzhen Shen, Jiaxuan You☆61Dec 30, 2025Updated 2 months ago
- ☆72Updated this week
- LLM for Scientific Research Survey☆126Jan 22, 2025Updated last year
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆64Jul 22, 2025Updated 7 months ago
- ☆63Sep 18, 2025Updated 5 months ago
- ☆67Mar 30, 2025Updated 11 months ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆27Jul 23, 2025Updated 7 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- Learning records for building a large language model from scratch☆59Jan 1, 2025Updated last year
- ☆25Aug 19, 2025Updated 6 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- ☆29Nov 9, 2025Updated 4 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆53May 21, 2025Updated 9 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- 一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。☆17May 16, 2025Updated 9 months ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆15Dec 30, 2024Updated last year
- ☆11Dec 16, 2024Updated last year
- Detecting Drift in a Diabetes Dataset using Taipy☆12May 19, 2025Updated 9 months ago
- ☆10Jul 30, 2023Updated 2 years ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆644Jul 29, 2025Updated 7 months ago
- ☆49Jul 22, 2024Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago