amair-lab / PiFlowLinks
[preprint] PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration
☆39Updated last month
Alternatives and similar repositories for PiFlow
Users that are interested in PiFlow are comparing it to the libraries listed below
Sorting:
- A collection of resources and papers on AI Scientist / Robot Scientist☆124Updated 4 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆85Updated last year
- (ICLR 2026) Optimas: Optimizing Compound AI Systems☆68Updated this week
- Structured Chemistry Reasoning with Large Language Models☆39Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆80Updated 6 months ago
- When Reasoning Meets Its Laws☆35Updated last month
- ☆67Updated 10 months ago
- ☆25Updated 2 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆112Updated last week
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆124Updated 5 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated last year
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated last year
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆73Updated 7 months ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆29Updated 9 months ago
- ☆56Updated 2 weeks ago
- Training Proactive and Personalized LLM Agents☆98Updated 3 weeks ago
- ☆43Updated 8 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆50Updated 3 weeks ago
- A RL env with procedurally generated symbolic reasoning data☆33Updated last week
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆52Updated 2 months ago
- ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization☆95Updated 8 months ago
- Official Repository of "GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration".☆43Updated 10 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆90Updated last year
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆25Updated 6 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆41Updated 5 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 6 months ago
- ☆19Updated 10 months ago
- Code for the paper "Larger and more instructable language models become less reliable"☆31Updated last year