The OlymMATH dataset
☆24Jun 1, 2025Updated 10 months ago
Alternatives and similar repositories for OlymMATH
Users that are interested in OlymMATH are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 8 months ago
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…☆28Feb 14, 2025Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- Codebase for Math Neurosurgery: Isolating LLMs' Math Reasoning Abilities Using Only Forward Passes☆21Jun 15, 2025Updated 10 months ago
- NeurIPS 2025: Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆53Mar 14, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- survery of small language models☆18Jul 23, 2024Updated last year
- The benchmark proposed in paper: GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability☆25Aug 12, 2025Updated 8 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated 9 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- ☆29Jan 23, 2024Updated 2 years ago
- A series of technical report on Slow Thinking with LLM☆764Aug 13, 2025Updated 8 months ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated 2 months ago
- ☆33May 27, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Oct 21, 2024Updated last year
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆40May 30, 2025Updated 10 months ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆38Feb 4, 2026Updated 2 months ago
- Modern development with Python in 2024☆12Updated this week
- ☆19Aug 4, 2025Updated 8 months ago
- "GraphArena: Evaluating and Exploring Large Language Models on Graph Computation" in ICLR 2025☆34Mar 2, 2025Updated last year
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Jul 25, 2024Updated last year
- Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific a…☆18Dec 25, 2024Updated last year
- ☆63Jun 12, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated 11 months ago
- This is the github to open source benchmark AdvancedIF, see LAMA L1387358RCRO☆31Nov 26, 2025Updated 4 months ago
- LaTeX Beamer template crafted for University of Illinois Chicago☆11Dec 7, 2024Updated last year
- ☆16Jun 14, 2023Updated 2 years ago
- [ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…☆68Oct 27, 2024Updated last year
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated last month
- ☆16May 22, 2025Updated 10 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆224Jul 25, 2025Updated 8 months ago
- ☆15Aug 25, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- ☆27Apr 14, 2025Updated last year
- Direct preference optimization with f-divergences.☆16Nov 3, 2024Updated last year
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆20Aug 15, 2025Updated 8 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆191Jun 8, 2025Updated 10 months ago
- ☆13Aug 27, 2019Updated 6 years ago
- Top Picks for Data Science Self-Study: From Newbies to Pros!☆11Apr 2, 2024Updated 2 years ago