tongyx361/symeval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tongyx361/symeval)

tongyx361 / symeval

Evaluation utilities based on SymPy.

☆22

Alternatives and similar repositories for symeval

Users that are interested in symeval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RUCAIBox / JiuZhang3.0
View on GitHub
The code and data for the paper JiuZhang3.0
☆49May 26, 2024Updated 2 years ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
iiis-ai / IterativeQuestionComposing
View on GitHub
[AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)
☆23Oct 2, 2025Updated 9 months ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
RUCAIBox / CIR
View on GitHub
☆16Nov 11, 2025Updated 8 months ago
THUDM / SWE-Dev
View on GitHub
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆64Jul 21, 2025Updated last year
simveit / effective_transpose
View on GitHub
Effective transpose on Hopper GPU
☆29Sep 6, 2025Updated 10 months ago
swtheing / PF-PPO-RLHF
View on GitHub
☆34Sep 14, 2024Updated last year
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
cooelf / dive-into-llms
View on GitHub
Dive-into-LLMs Tutorial for Beginners
☆27May 14, 2024Updated 2 years ago
babelouest / angharad
View on GitHub
Personal house automation system with a REST/Json interface
☆18Feb 20, 2024Updated 2 years ago
tqch / poisson-jump
View on GitHub
Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)
☆10Jun 6, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
eesast / THUAI6
View on GitHub
清华大学第六届人工智能挑战赛电子系赛道（原电子系第 24 届队式程序设计大赛 teamstyle24）
☆29May 11, 2024Updated 2 years ago
gao-xiao-bai / JsonTuning
View on GitHub
JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning
☆10Nov 3, 2024Updated last year
babelouest / yder
View on GitHub
Logging library for C applications
☆23Apr 26, 2026Updated 3 months ago
albertqjiang / MMA
View on GitHub
The official repository for the paper Multilingual Mathematical Autoformalization
☆39May 20, 2024Updated 2 years ago
SHUMKASHUN / Plots
View on GitHub
This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots
☆39Mar 4, 2024Updated 2 years ago
SWE-Gym / SWE-Bench-Fork
View on GitHub
☆13Mar 5, 2025Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
druidowm / OccamLLM
View on GitHub
☆14Oct 21, 2024Updated last year
jellydn / modern-python-2024-demo
View on GitHub
Modern development with Python in 2024
☆12Updated this week
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
behrouz-rfa / mongo-specification
View on GitHub
☆16Apr 23, 2023Updated 3 years ago
princeton-nlp / PTP
View on GitHub
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
☆32Jul 9, 2024Updated 2 years ago
GongRzhe / Calendar-Autoauth-MCP-Server
View on GitHub
A Model Context Protocol (MCP) server for Google Calendar integration in Cluade Desktop with auto authentication support. This server ena…
☆12Mar 11, 2025Updated last year
hughbzhang / o1_inference_scaling_laws
View on GitHub
Replicating O1 inference-time scaling laws
☆94Dec 1, 2024Updated last year
sdpa-python / sdpa-python
View on GitHub
SemiDefinite Programming Algorithm (SDPA) for Python
☆12Jul 1, 2026Updated 3 weeks ago
JunyiYe / CreativeMath
View on GitHub
[AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems
☆13May 5, 2025Updated last year
ZishunYu / Actor-Critic-Alignment
View on GitHub
Implementation of ``Actor-Critic Alignment for Offline-to-Online Reinforcement Learning''
☆13Oct 12, 2023Updated 2 years ago
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
DAMO-NLP-SG / IE-E2H
View on GitHub
Easy-to-Hard Learning for Information Extraction (ACL 2023 Findings)
☆14Jul 11, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ByungKwanLee / Phantom
View on GitHub
[Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …
☆63Oct 9, 2024Updated last year
Nexusphobiker / MHWSaveEditor
View on GitHub
Work in progress save editor for Monster Hunter: World
☆11Aug 15, 2018Updated 7 years ago
kh4nh12 / self_study_ds
View on GitHub
Top Picks for Data Science Self-Study: From Newbies to Pros!
☆11Apr 2, 2024Updated 2 years ago
QwenLM / PolyMath
View on GitHub
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆43May 22, 2025Updated last year
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
ccibeekeoc42 / Meta_Llama
View on GitHub
This is the Placeholder for Llama. Starting with Llama 3
☆11May 20, 2024Updated 2 years ago
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year