TreeAI-Lab/NumericBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TreeAI-Lab/NumericBench)

TreeAI-Lab / NumericBench

A comprehensive benchmark to evaluate and improve the fundamental numerical reasoning abilities of large language models using diverse synthetic and real-world datasets.

☆29

Alternatives and similar repositories for NumericBench

Users that are interested in NumericBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JOHNNY-fans / MedOdyssey
View on GitHub
☆28May 29, 2025Updated last year
abhijangda / nextdoor-experiments
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
liuchengwucn / Safe
View on GitHub
(ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…
☆21Dec 26, 2025Updated 6 months ago
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
mitchellh / omniconfig
View on GitHub
Flexible configuration for your Ruby applications and libraries.
☆16Apr 12, 2012Updated 14 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
polinapolina / temporal-pagerank
View on GitHub
☆13May 16, 2016Updated 10 years ago
MileBench / MileBench
View on GitHub
This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"
☆38Jul 11, 2024Updated 2 years ago
uncbiag / UniLMMV
View on GitHub
☆11Mar 25, 2024Updated 2 years ago
Guerbet-AI / wsp-contrastive
View on GitHub
Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023
☆11Dec 16, 2025Updated 7 months ago
BU-DiSC / pvldb-pdfa-resources
View on GitHub
☆19Jul 31, 2023Updated 2 years ago
rvenet / RVENet
View on GitHub
Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…
☆12Mar 10, 2024Updated 2 years ago
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
JiaxinZhuang / Deep-Learning
View on GitHub
If you are trying to find the download url of specific datasets or some books
☆14Apr 18, 2020Updated 6 years ago
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kAIto47802 / Prover-Agent
View on GitHub
Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs
☆28Nov 1, 2025Updated 8 months ago
needylove / PH-Reg
View on GitHub
The code of "Deep Regression Representation Learning with Topology" in ICML 2024
☆14Jul 4, 2024Updated 2 years ago
Episoode / Double-Bench
View on GitHub
[AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?
☆31Dec 14, 2025Updated 7 months ago
bintsi / adaptive-graph-learning
View on GitHub
Code for the paper "Multimodal brain age estimation using interpretable adaptive population-graph learning"
☆10Dec 4, 2023Updated 2 years ago
GarrettJenkinson / condor_pytorch
View on GitHub
CONditionals for Ordinal Regression and classification in PyTorch
☆12Nov 5, 2022Updated 3 years ago
AlexIoannides / llm-regression
View on GitHub
Exploring the classical regression capabilities of LLMs.
☆18May 20, 2024Updated 2 years ago
wjx-error / ProtoSurv
View on GitHub
The implementations for NeurIPS 2024 paper "Leveraging Tumor Heterogeneity: Heterogeneous Graph Representation Learning for Cancer Surviv…
☆15Jun 11, 2025Updated last year
spcl / smat
View on GitHub
Code for High Performance Unstructured SpMM Computation Using Tensor Cores
☆35Nov 3, 2024Updated last year
deeplearning-wisc / NSCL
View on GitHub
Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"
☆14Jun 24, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dawnnao / APESS2018_Steel_Girder_Crack_ID_dataset
View on GitHub
Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification
☆11Oct 15, 2018Updated 7 years ago
hectorcarrion / FEDD
View on GitHub
Data & Code for FEDD published @ MICCAI 23
☆12Oct 11, 2023Updated 2 years ago
AIPMLab / FACMIC
View on GitHub
Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)
☆14Jun 21, 2024Updated 2 years ago
ninatu / learning_by_sorting
View on GitHub
Official implementation of "Learning by Sorting: Self-supervised Learning with Group Ordering Constraints." ICCV 2023
☆16Nov 12, 2023Updated 2 years ago
pm25 / Semi-Supervised-Regression
View on GitHub
[NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'
☆14Aug 22, 2025Updated 11 months ago
BCV-Uniandes / SpaRED
View on GitHub
☆13May 28, 2025Updated last year
tcoyze / stochastic-blockmodel
View on GitHub
Stochastic Block Models - generate, detect, and recover
☆22Aug 14, 2016Updated 9 years ago
asherliu / tensortao
View on GitHub
Fastest software for special tensor operations
☆21Nov 23, 2023Updated 2 years ago
zhiyiscs / MoRA
View on GitHub
[MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality
☆14Sep 26, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
robinyjpark / AutoLabelClassifier
View on GitHub
☆10Oct 24, 2024Updated last year
GradiusTwinbee / GLIS
View on GitHub
officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"
☆14Jul 4, 2024Updated 2 years ago
timmy11hu / ConOR
View on GitHub
[ECCV 2022] Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression
☆13Mar 27, 2023Updated 3 years ago
BorealisAI / ConR
View on GitHub
Contrastive Regularizer
☆15Feb 15, 2024Updated 2 years ago
XYPB / CLEFT
View on GitHub
Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…
☆18Feb 12, 2025Updated last year
yhygao / Explicd
View on GitHub
☆18Sep 19, 2024Updated last year
MrGiovanni / OnlineLearning
View on GitHub
[MICCAI 2024] Embracing Massive Medical Data
☆21Jul 5, 2024Updated 2 years ago