A comprehensive benchmark to evaluate and improve the fundamental numerical reasoning abilities of large language models using diverse synthetic and real-world datasets.
☆29Jun 21, 2025Updated 10 months ago
Alternatives and similar repositories for NumericBench
Users that are interested in NumericBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 第十九届“挑战杯”揭榜挂帅专项赛华为赛道打榜第一&国家特等奖-拔萝卜的工程队作品仓库 19th Challenge Cup National Grand Prize☆35Mar 18, 2026Updated last month
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- [ICML'24 Oral] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning wi…☆32Jun 21, 2024Updated last year
- 南昌大学超算队官方网站☆19Aug 9, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HUST-CS-2019 硬件综合训练-组原课设-riscv实现☆16Nov 3, 2022Updated 3 years ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆24Jan 27, 2026Updated 3 months ago
- ☆13May 16, 2016Updated 9 years ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆37Jul 11, 2024Updated last year
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 4 months ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- Domain adaptation framework for segmentation via reinforcement learning.☆14Oct 13, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- B站助手,全屏显示SC,评论显示IP属地☆24Jul 3, 2025Updated 9 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- Code for the paper "Multimodal brain age estimation using interpretable adaptive population-graph learning"☆10Dec 4, 2023Updated 2 years ago
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- ☆50Apr 4, 2026Updated 3 weeks ago
- ☆11Mar 25, 2024Updated 2 years ago
- ☆20Jun 12, 2025Updated 10 months ago
- Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification☆12Oct 15, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- Official implementation of "Learning by Sorting: Self-supervised Learning with Group Ordering Constraints." ICCV 2023☆16Nov 12, 2023Updated 2 years ago
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆22Dec 18, 2024Updated last year
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated last year
- The implementations for NeurIPS 2024 paper "Leveraging Tumor Heterogeneity: Heterogeneous Graph Representation Learning for Cancer Surviv…☆12Jun 11, 2025Updated 10 months ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆20Mar 31, 2025Updated last year
- 无名杀的分支 | A fork of the board game https://github.com/libccy/noname☆23Mar 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Oct 24, 2024Updated last year
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data☆20Apr 3, 2025Updated last year
- [MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality☆12Sep 26, 2025Updated 7 months ago
- Individual Coefficient Approximation for Risk Estimation (ICARE) model☆18Sep 9, 2023Updated 2 years ago
- [ECCV 2022] Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression☆13Mar 27, 2023Updated 3 years ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- [ICLR 2026] LightMem: Lightweight and Efficient Memory-Augmented Generation☆779Apr 24, 2026Updated last week