google / curie
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆21Updated 3 weeks ago
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below
Sorting:
- Structured Chemistry Reasoning with Large Language Models☆38Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 2 months ago
- ☆50Updated 2 months ago
- implementation of dualformer☆17Updated 2 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆40Updated 6 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆38Updated last year
- ☆31Updated 4 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆18Updated 2 months ago
- ☆21Updated 7 months ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆44Updated 5 months ago
- ☆27Updated this week
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆39Updated 7 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆38Updated 3 weeks ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆58Updated 4 months ago
- Code for the paper "Larger and more instructable language models become less reliable"☆29Updated 7 months ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆43Updated 2 weeks ago
- ☆38Updated last month
- Exploration of automated dataset selection approaches at large scales.☆40Updated 2 months ago
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆55Updated 3 months ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆16Updated last month
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆41Updated 2 weeks ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding☆29Updated 2 weeks ago
- Code for "Reasoning to Learn from Latent Thoughts"☆94Updated last month
- Official Code for Paper "Think While You Generate: Discrete Diffusion with Planned Denoising" [ICLR 2025]☆58Updated 3 weeks ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 8 months ago
- Process Reward Models That Think☆32Updated this week
- Call for participation in the impact of LLM for scientific discovery☆69Updated last year
- ☆26Updated last month