Zhenwen-NLP/MathChat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Zhenwen-NLP/MathChat)

Zhenwen-NLP / MathChat

Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

☆22

Alternatives and similar repositories for MathChat

Users that are interested in MathChat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MikeWangWZHL / Zemi
View on GitHub
Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings
☆15May 3, 2023Updated 3 years ago
yudiandoris / csi
View on GitHub
End-to-End Chinese Speaker Identification
☆11Nov 17, 2022Updated 3 years ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
sail-sg / ActivePRM
View on GitHub
☆21Apr 16, 2025Updated last year
zhuzilin / vllm-group
View on GitHub
☆12Nov 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
psunlpgroup / FoVer
View on GitHub
This repository includes code and materials for the paper "Efficient PRM Training Data Synthesis via Formal Verification" (ACL 2026 Findi…
☆18Apr 7, 2026Updated 3 months ago
manjunath5496 / Mathematics-Tutorial
View on GitHub
"A relativist is an individual who doesn't know the difference between an adjective and an adverb." ― Bill Gaede
☆24Dec 3, 2020Updated 5 years ago
Alab-NII / mrc-heuristics
View on GitHub
☆18Nov 17, 2018Updated 7 years ago
kaushal0494 / UnifyingAITutorEvaluation
View on GitHub
An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
☆29Mar 2, 2026Updated 4 months ago
yyDing1 / ScaleQuest
View on GitHub
[ACL 2025] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLM…
☆69Oct 27, 2024Updated last year
google / curie
View on GitHub
Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025
☆34Apr 21, 2025Updated last year
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
umass-ml4ed / dialogue-kt
View on GitHub
Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.
☆38Feb 12, 2025Updated last year
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KbsdJames / Omni-MATH
View on GitHub
The official repository of the Omni-MATH benchmark.
☆94Dec 22, 2024Updated last year
kdu4108 / semiring-backprop-exps
View on GitHub
☆16Jul 10, 2023Updated 3 years ago
bammt / Learn-to-check
View on GitHub
the datasets of our paper
☆11Feb 26, 2024Updated 2 years ago
YuxiangChai / OpenSlides
View on GitHub
AI-powered slide workspace for creating, editing, versioning, and presenting beautiful reveal.js decks from prompts and source files.
☆15Apr 14, 2026Updated 3 months ago
ablghtianyi / ICL_Modular_Arithmetic
View on GitHub
☆19Mar 25, 2025Updated last year
majianz / dl4gps
View on GitHub
[ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"
☆36Sep 14, 2025Updated 10 months ago
hccngu / DialCoT
View on GitHub
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models
☆13Nov 2, 2023Updated 2 years ago
Eydcao / Yan-CG-SnowSim
View on GitHub
My final project, Snow Simulation, for Prof. Lingqi Yan's online open course games 101-Intro to Modern Computer Graphics
☆12Mar 12, 2021Updated 5 years ago
QipengGuo / GraphWriter-DGL
View on GitHub
☆13Nov 13, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
doeun-235 / Cucker-Smale-Model
View on GitHub
Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib
☆12Feb 14, 2024Updated 2 years ago
GAIR-NLP / ReasonEval
View on GitHub
[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy
☆80Oct 9, 2025Updated 9 months ago
hccngu / Meta-SN
View on GitHub
☆11May 23, 2023Updated 3 years ago
uwsbel / low-fidelity-dynamic-models
View on GitHub
A library of fast and accurate low fidelity dynamic models for applications in robotics
☆14Jul 12, 2024Updated 2 years ago
Scientific-Computing-Lab / MPI-rigen
View on GitHub
MPI Code Generation through Domain-Specific Language Models
☆16Nov 19, 2024Updated last year
cooelf / CompassMTL
View on GitHub
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Oct 17, 2022Updated 3 years ago
NoemieJaquier / sequencing-blending
View on GitHub
This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".
☆14Sep 11, 2022Updated 3 years ago
jumxglhf / GraphPatcher
View on GitHub
Official repository for NeurIPS'23 paper: GraphPatcher: Mitigating Degree Bias for Graph Neural Networks via Test-time Augmentation
☆17Oct 1, 2023Updated 2 years ago
ytyz1307zzh / PLUG
View on GitHub
Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"
☆13Aug 13, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wangcunxiang / Graph-aS-Tokens
View on GitHub
☆10Nov 29, 2024Updated last year
chin-gyou / controllable-selection
View on GitHub
☆14Nov 10, 2019Updated 6 years ago
M3RG-IITD / benchmarking_graph
View on GitHub
☆12Nov 30, 2022Updated 3 years ago
eth-lre / mathtutorbench
View on GitHub
Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
☆38Nov 18, 2025Updated 8 months ago
da03 / WildVisualizer
View on GitHub
☆28Nov 19, 2025Updated 8 months ago
DM2-ND / EDMem
View on GitHub
Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"
☆15Apr 24, 2023Updated 3 years ago
Ipuch / variational_integrator
View on GitHub
biorbd + casadi + variational integrator
☆10Jul 8, 2026Updated last week