x66ccff / liveideabenchLinks
π€π‘ LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context
β19Updated 2 months ago
Alternatives and similar repositories for liveideabench
Users that are interested in liveideabench are comparing it to the libraries listed below
Sorting:
- Tree-of-Debate converts scientific papers into LLM personas that debate their respective novelties. To emphasize structured, critical reaβ¦β17Updated 6 months ago
- LLM for Scientific Research Surveyβ118Updated last year
- β63Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β141Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discoveryβ123Updated 5 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590β79Updated 6 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learningβ71Updated 8 months ago
- Official Implementation of the Baby-AIGS systemβ24Updated last year
- Code/data for MARG (multi-agent review generation)β59Updated 4 months ago
- A collection of resources and papers on AI Scientist / Robot Scientistβ120Updated 4 months ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?β101Updated 5 months ago
- β67Updated 10 months ago
- REverse-Engineered Reasoning for Open-Ended Generationβ89Updated 4 months ago
- A curated list of papers on LLMs and agents for scientific research and developmentβ84Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examplesβ114Updated 6 months ago
- Process Reward Models That Thinkβ77Updated 2 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"β84Updated 2 months ago
- β42Updated 8 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award β¦β42Updated last year
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.β52Updated 5 months ago
- β72Updated 7 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generationβ56Updated 4 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?β32Updated 5 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningβ120Updated 8 months ago
- β43Updated 5 months ago
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.β120Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agentβ68Updated 8 months ago
- A trainable user simulatorβ34Updated 7 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"β60Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awarenessβ91Updated 7 months ago