google / curieLinks
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆26Updated 4 months ago
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below
Sorting:
- implementation of dualformer☆20Updated 6 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆93Updated last month
- Esoteric Language Models☆94Updated last month
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆86Updated this week
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆68Updated last month
- A collection of resources and papers on AI Scientist / Robot Scientist☆91Updated 2 months ago
- SSRL: Self-Search Reinforcement Learning☆93Updated last week
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆98Updated last week
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆64Updated 7 months ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆17Updated 5 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆140Updated this week
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆73Updated 6 months ago
- ☆98Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated 11 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆58Updated last year
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆53Updated 9 months ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆45Updated 8 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆41Updated 10 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆12Updated 2 months ago
- ☆27Updated 3 months ago
- ☆213Updated 6 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆65Updated 3 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆38Updated 5 months ago
- Official Jax Implementation of MD4 Masked Diffusion Models☆123Updated 6 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 5 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆82Updated 10 months ago
- MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion (ACL 2025)☆29Updated last month
- ☆87Updated last month
- Call for participation in the impact of LLM for scientific discovery☆73Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆107Updated 3 weeks ago