GAIR-NLP / SR-ScientistLinks
SR-Scientist: Scientific Equation Discovery With Agentic AI
☆29Updated 2 months ago
Alternatives and similar repositories for SR-Scientist
Users that are interested in SR-Scientist are comparing it to the libraries listed below
Sorting:
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 6 months ago
- Data Synthesis for Deep Research Based on Semi-Structured Data☆191Updated 3 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆87Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 7 months ago
- ☆67Updated 9 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 7 months ago
- LIMI: Less is More for Agency☆156Updated 2 months ago
- ☆74Updated 3 months ago
- ☆39Updated last year
- ☆32Updated last year
- ☆41Updated 7 months ago
- ☆63Updated 6 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆59Updated 10 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆227Updated 3 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆35Updated 3 months ago
- ☆23Updated last year
- SSRL: Self-Search Reinforcement Learning☆201Updated 4 months ago
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆95Updated 2 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆140Updated last year
- ☆63Updated last year
- ☆93Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆85Updated 9 months ago
- The official repo for the code and data of paper SMART☆37Updated 10 months ago
- ☆37Updated 2 months ago
- HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.☆98Updated this week
- ☆105Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆197Updated last month