ZonglinY / MOOSE
[ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award in ICML 2024 AI4Science workshop.
☆36Updated 2 months ago
Alternatives and similar repositories for MOOSE:
Users that are interested in MOOSE are comparing it to the libraries listed below
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆35Updated 2 weeks ago
- Evaluate the Quality of Critique☆35Updated 7 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆68Updated last month
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆70Updated 9 months ago
- ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆33Updated this week
- ☆110Updated 6 months ago
- ☆20Updated this week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆53Updated 10 months ago
- ☆37Updated 3 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆32Updated 5 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 3 months ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆29Updated last month
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- ☆19Updated 3 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆65Updated 2 weeks ago
- Replicating O1 inference-time scaling laws☆70Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆57Updated 2 weeks ago
- ☆14Updated 3 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆51Updated last month
- ☆20Updated 7 months ago
- ☆36Updated 5 months ago
- ☆64Updated 11 months ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆31Updated 6 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆80Updated 10 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆93Updated 11 months ago
- ☆26Updated 6 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆30Updated last month