A comprehensive benchmark to evaluate and improve the fundamental numerical reasoning abilities of large language models using diverse synthetic and real-world datasets.
☆29Jun 21, 2025Updated 11 months ago
Alternatives and similar repositories for NumericBench
Users that are interested in NumericBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆331Dec 5, 2025Updated 6 months ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 5 months ago
- ☆28May 29, 2025Updated last year
- [ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models☆27May 14, 2024Updated 2 years ago
- A C++ implementation of Walker's Alias Method for quickly sampling from an array with a given probability distribution☆10Mar 16, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Dec 8, 2022Updated 3 years ago
- Awesome Few-Shot Learning on Graphs☆25Apr 27, 2025Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- ☆13Jun 8, 2021Updated 5 years ago
- [ICML'24 Oral] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning wi…☆32Jun 21, 2024Updated last year
- Search Self-Play: Pushing the Frontier of Agent Capability without Supervision☆100Mar 4, 2026Updated 3 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆37Jul 11, 2024Updated last year
- ☆26Jan 14, 2017Updated 9 years ago
- ☆14Oct 23, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 5 months ago
- Code repository for the VLDB2023 paper "Zebra: When Temporal Graph Neural Networks Meet Temporal Personalized PageRank".☆11Apr 26, 2024Updated 2 years ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- ☆78Apr 15, 2026Updated last month
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- Domain adaptation framework for segmentation via reinforcement learning.☆15Oct 13, 2025Updated 7 months ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- If you are trying to find the download url of specific datasets or some books☆14Apr 18, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "Multimodal brain age estimation using interpretable adaptive population-graph learning"☆10Dec 4, 2023Updated 2 years ago
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- ☆18Jul 31, 2023Updated 2 years ago
- The code of "Deep Regression Representation Learning with Topology" in ICML 2024☆14Jul 4, 2024Updated last year
- ☆58Apr 4, 2026Updated 2 months ago
- ☆11Mar 25, 2024Updated 2 years ago
- ☆20Jun 12, 2025Updated 11 months ago
- The codebase and datasets for the IJCAI 2021 paper "The Surprising Power of Graph Neural Networks with Random Node Initialization".☆22Jun 3, 2021Updated 5 years ago
- Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification☆11Oct 15, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- Official implementation of "Learning by Sorting: Self-supervised Learning with Group Ordering Constraints." ICCV 2023☆16Nov 12, 2023Updated 2 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 9 months ago
- ☆13May 28, 2025Updated last year
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆22Dec 18, 2024Updated last year
- BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks☆23Mar 7, 2024Updated 2 years ago
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated last year