google / curieLinks
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆27Updated 5 months ago
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below
Sorting:
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 6 months ago
- implementation of dualformer☆20Updated 6 months ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆95Updated 3 months ago
- SSRL: Self-Search Reinforcement Learning☆144Updated last month
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆64Updated 8 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆77Updated 7 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆68Updated last month
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆100Updated 2 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆103Updated last month
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆16Updated 5 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆95Updated last month
- Structured Chemistry Reasoning with Large Language Models☆38Updated last year
- TraceRL - Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆206Updated this week
- ☆28Updated 4 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆29Updated 2 months ago
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆76Updated 5 months ago
- ☆35Updated 4 months ago
- Esoteric Language Models☆99Updated 2 months ago
- Official Implementation of the Baby-AIGS system☆23Updated 10 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆54Updated 10 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆70Updated 4 months ago
- ☆48Updated 7 months ago
- [ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models☆75Updated last month
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆40Updated 2 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆13Updated 2 months ago
- ☆215Updated 7 months ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆43Updated last week
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 6 months ago
- Call for participation in the impact of LLM for scientific discovery☆73Updated last year