google / curieLinks
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆29Updated 9 months ago
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below
Sorting:
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆79Updated 6 months ago
- Defeating the Training-Inference Mismatch via FP16☆180Updated 2 months ago
- Structured Chemistry Reasoning with Large Language Models☆39Updated last year
- Esoteric Language Models☆109Updated 2 months ago
- implementation of dualformer☆24Updated 11 months ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆120Updated 4 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆119Updated 3 weeks ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆94Updated 2 weeks ago
- Optimize Any User-defined Compound AI Systems☆66Updated 5 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆28Updated 10 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆71Updated last year
- ☆19Updated 6 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated last year
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆51Updated last month
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆86Updated 11 months ago
- Reproduce ICLR2025 Energy-Based Diffusion Language Models for Text Generation☆58Updated 6 months ago
- ☆37Updated 8 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆124Updated last week
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆50Updated 2 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆84Updated last year
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆111Updated 2 months ago
- ☆42Updated 8 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆84Updated 9 months ago
- ☆89Updated 3 months ago
- ☆108Updated last year
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆29Updated 4 months ago
- ☆11Updated 2 years ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Updated 7 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆123Updated 5 months ago