A comprehensive benchmark to evaluate and improve the fundamental numerical reasoning abilities of large language models using diverse synthetic and real-world datasets.
☆29Jun 21, 2025Updated 11 months ago
Alternatives and similar repositories for NumericBench
Users that are interested in NumericBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- [ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models☆27May 14, 2024Updated 2 years ago
- A C++ implementation of Walker's Alias Method for quickly sampling from an array with a given probability distribution☆10Mar 16, 2016Updated 10 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICML'24 Oral] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning wi…☆32Jun 21, 2024Updated last year
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆26Jan 27, 2026Updated 3 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆37Jul 11, 2024Updated last year
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 5 months ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆17Jun 20, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- CONditionals for Ordinal Regression and classification in PyTorch☆12Nov 5, 2022Updated 3 years ago
- ☆11Mar 25, 2024Updated 2 years ago
- ☆20Jun 12, 2025Updated 11 months ago
- Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification☆12Oct 15, 2018Updated 7 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- Official implementation of "Learning by Sorting: Self-supervised Learning with Group Ordering Constraints." ICCV 2023☆16Nov 12, 2023Updated 2 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 9 months ago
- ☆13May 28, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆22Dec 18, 2024Updated last year
- BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks☆22Mar 7, 2024Updated 2 years ago
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated last year
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated last year
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- ☆10Oct 24, 2024Updated last year
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data☆20Apr 3, 2025Updated last year
- [MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality☆12Sep 26, 2025Updated 7 months ago
- Individual Coefficient Approximation for Risk Estimation (ICARE) model☆18Sep 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ECCV 2022] Uncertainty Quantification in Depth Estimation via Constrained Ordinal Regression☆13Mar 27, 2023Updated 3 years ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Jul 4, 2024Updated last year
- ☆15May 15, 2025Updated last year
- ☆15Jul 15, 2023Updated 2 years ago
- Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MIC…☆18Feb 12, 2025Updated last year
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆19Nov 4, 2025Updated 6 months ago
- [EMNLP 2024] Implementation of vision-language model fine-tuning via simple parameter-efficient modification☆19Nov 24, 2024Updated last year