Evaluation utilities based on SymPy.
☆22Dec 12, 2024Updated last year
Alternatives and similar repositories for symeval
Users that are interested in symeval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- ☆12Nov 5, 2024Updated last year
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 7 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆121Dec 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆52Mar 5, 2025Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆80Oct 9, 2025Updated 7 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- ☆35Sep 14, 2024Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- ☆14Oct 21, 2024Updated last year
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Mar 4, 2024Updated 2 years ago
- Logging library for C applications☆23Apr 26, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- ☆52Mar 9, 2026Updated 2 months ago
- ☆13Jul 14, 2024Updated last year
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆126May 6, 2025Updated last year
- ☆13Mar 5, 2025Updated last year
- Modern development with Python in 2024☆12Apr 27, 2026Updated last week
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆20Jun 27, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Model Context Protocol (MCP) server for Google Calendar integration in Cluade Desktop with auto authentication support. This server ena…☆13Mar 11, 2025Updated last year
- [AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems☆13May 5, 2025Updated last year
- LaTeX Beamer template crafted for University of Illinois Chicago☆11Dec 7, 2024Updated last year
- Experimental online password and secret locker☆20Feb 20, 2024Updated 2 years ago
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated last month
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 9 months ago
- This is the unofficial implementation of LEMON (ICLR'2024).☆13Apr 14, 2024Updated 2 years ago
- Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''☆13Oct 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the Placeholder for Llama. Starting with Llama 3☆11May 20, 2024Updated last year
- ☆1,137Jan 10, 2026Updated 3 months ago
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆64Oct 9, 2024Updated last year
- Easy-to-Hard Learning for Information Extraction (ACL 2023 Findings)☆14Jul 11, 2023Updated 2 years ago
- SemiDefinite Programming Algorithm (SDPA) for Python☆12Jan 27, 2025Updated last year
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆20Aug 15, 2025Updated 8 months ago
- [ACL 2023 Findings] Emergent Modularity in Pre-trained Transformers☆26Jun 7, 2023Updated 2 years ago