google / curieLinks
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆23Updated last month
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below
Sorting:
- ☆32Updated 5 months ago
- implementation of dualformer☆17Updated 3 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆18Updated 3 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion (ACL 2025)☆24Updated 2 weeks ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning☆53Updated 2 months ago
- ☆22Updated 3 weeks ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆69Updated this week
- ☆21Updated 8 months ago
- Exploration of automated dataset selection approaches at large scales.☆42Updated 3 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆27Updated last week
- Official implementation for the paper "Can Large Reasoning Models Self-Train?"☆32Updated last week
- ☆72Updated last month
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆44Updated 6 months ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆42Updated this week
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆12Updated 2 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆40Updated this week
- A repository for research on medium sized language models.☆76Updated last year
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆60Updated 3 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated last week
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 2 months ago
- Make reasoning models scalable☆32Updated last week
- Official Repository of LatentSeek☆30Updated 2 weeks ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆84Updated 8 months ago
- Process Reward Models That Think☆38Updated last week
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆41Updated 7 months ago
- Call for participation in the impact of LLM for scientific discovery☆72Updated last year