Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆31Apr 21, 2025Updated 11 months ago
Alternatives and similar repositories for curie
Users that are interested in curie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository containing dataset, models and code associated with the CHIME project☆17Aug 22, 2024Updated last year
- This website contains the python code accompanying the book "Mathematical Foundations of Deep Learning Models and Algorithms" by Konstant…☆53Nov 24, 2025Updated 4 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- ☆20Jan 18, 2022Updated 4 years ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆52Nov 12, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆25Jan 11, 2023Updated 3 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- CUDA, CuDNN, NVIDIA Driver, and PyTorch Installation for Ubuntu☆12Feb 27, 2025Updated last year
- An environment for benchmarking commonsense agents☆29Aug 19, 2020Updated 5 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆26Jul 13, 2025Updated 9 months ago
- ☆34Apr 1, 2025Updated last year
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆13Jun 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆27Updated this week
- Tools for molecular Docking☆27Jul 24, 2025Updated 8 months ago
- ☆11Nov 28, 2022Updated 3 years ago
- Code for BYOP [CVPR 2023]☆11Sep 25, 2023Updated 2 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 4 months ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated last year
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆76Jan 13, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NEFF Calculator and MSA File Converter☆13Sep 16, 2025Updated 6 months ago
- Codebase for the Graph-based Policy Learning algorithm, which is designed for learning policies to solve the open ad hoc teamwork problem…☆35Mar 31, 2021Updated 5 years ago
- Land use determination and urbanization over time from landsat images☆13Nov 15, 2017Updated 8 years ago
- ☆10Nov 6, 2024Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆183May 14, 2025Updated 11 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆193Sep 13, 2025Updated 7 months ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 4 years ago
- Multimodal RewardBench☆67Feb 21, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- O guia perfeito de como fazer o README.md memorável e decente.☆16Jan 13, 2025Updated last year
- Discovering Data-driven Hypotheses in the Wild☆137Jun 9, 2025Updated 10 months ago
- MASSW is a comprehensive text dataset on Multi-Aspect Summarization of Scientific Workflows. MASSW includes more than 152,000 peer-review…☆21May 16, 2025Updated 10 months ago
- Implementation of "Visual Sentiment Prediction based on Automatic Discovery of Affective Regions"☆13May 23, 2019Updated 6 years ago
- [CVPR CVSPORTS 2025] Official implementation of paper - SoccerNet-v3D: Leveraging Sports Broadcast Replays for 3D Scene Understanding☆43Jun 18, 2025Updated 9 months ago
- ☆13Nov 2, 2024Updated last year
- ☆13Apr 16, 2025Updated 11 months ago