Evaluation utilities based on SymPy.
☆22Dec 12, 2024Updated last year
Alternatives and similar repositories for symeval
Users that are interested in symeval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- ☆12Nov 5, 2024Updated last year
- Dive-into-LLMs Tutorial for Beginners☆16May 14, 2024Updated last year
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆121Dec 10, 2024Updated last year
- ☆52Mar 5, 2025Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆78Oct 9, 2025Updated 6 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 10 months ago
- Personal house automation system with a REST/Json interface☆18Feb 20, 2024Updated 2 years ago
- Course Rusher for BJUT☆14Dec 28, 2017Updated 8 years ago
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆23Jun 17, 2025Updated 10 months ago
- (ICML 2025) Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning☆10Jun 19, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Potluck with different functions for different purposes that can be shared among C programs☆13Mar 4, 2024Updated 2 years ago
- ☆35Sep 14, 2024Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- 清华大学第六届人工智能挑战赛电子系赛道(原电子系第 24 届队式程序设计大赛 teamstyle24)☆28May 11, 2024Updated last year
- ☆14Oct 21, 2024Updated last year
- This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots☆39Mar 4, 2024Updated 2 years ago
- Logging library for C applications☆23Mar 4, 2024Updated 2 years ago
- ☆13Jul 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆125May 6, 2025Updated 11 months ago
- ☆13Mar 5, 2025Updated last year
- Modern development with Python in 2024☆12Updated this week
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- inductive reasoning benchmark with subregular hierarchy for string-to-string transformation☆20Jun 27, 2025Updated 9 months ago
- A Model Context Protocol (MCP) server for Google Calendar integration in Cluade Desktop with auto authentication support. This server ena…☆13Mar 11, 2025Updated last year
- Experimental online password and secret locker☆20Feb 20, 2024Updated 2 years ago
- Replicating O1 inference-time scaling laws☆93Dec 1, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Convert MathML to Latex for OneNote to Markdown☆13Mar 17, 2026Updated last month
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 8 months ago
- Source code for LearnQtGuide's Threading and IPC with Qt C++ Course☆17Nov 11, 2019Updated 6 years ago
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.☆59Nov 24, 2025Updated 4 months ago
- This is the Placeholder for Llama. Starting with Llama 3☆11May 20, 2024Updated last year
- ☆1,129Jan 10, 2026Updated 3 months ago
- Easy-to-Hard Learning for Information Extraction (ACL 2023 Findings)☆14Jul 11, 2023Updated 2 years ago