A comprehensive benchmark to evaluate and improve the fundamental numerical reasoning abilities of large language models using diverse synthetic and real-world datasets.
☆29Jun 21, 2025Updated 9 months ago
Alternatives and similar repositories for NumericBench
Users that are interested in NumericBench are comparing it to the libraries listed below
Sorting:
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆293Dec 5, 2025Updated 3 months ago
- ☆28May 29, 2025Updated 9 months ago
- Exploring prompt tuning with pseudolabels for multiple modalities, learning settings, and training strategies.☆49Nov 8, 2024Updated last year
- [ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models☆26May 14, 2024Updated last year
- A C++ implementation of Walker's Alias Method for quickly sampling from an array with a given probability distribution☆10Mar 16, 2016Updated 10 years ago
- Awesome Few-Shot Learning on Graphs☆22Apr 27, 2025Updated 10 months ago
- ☆12Dec 8, 2022Updated 3 years ago
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- ☆13Jun 8, 2021Updated 4 years ago
- Materials for the LLM Evals Workshop from Weights & BIases☆14Feb 24, 2025Updated last year
- [ICML 2024] Offical code repo for ICML2024 paper "Candidate Pseudolabel Learning: Enhancing Vision-Language Models by Prompt Tuning with …☆31Jun 21, 2024Updated last year
- fully javascript usbtinyisp like adafruit usbtinyisp, mit fabisp or sparkfun tinyavr☆11Feb 3, 2019Updated 7 years ago
- Data for "Early Identification of Depression Severity Levels on Reddit Using Ordinal Classification" paper accepted at The Web Conference…☆21May 16, 2022Updated 3 years ago
- ☆13May 16, 2016Updated 9 years ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆36Jul 11, 2024Updated last year
- ☆14Oct 23, 2021Updated 4 years ago
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 3 months ago
- ☆25Jul 23, 2025Updated 7 months ago
- Code repository for the VLDB2023 paper "Zebra: When Temporal Graph Neural Networks Meet Temporal Personalized PageRank".☆11Apr 26, 2024Updated last year
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- Domain adaptation framework for segmentation via reinforcement learning.☆13Oct 13, 2025Updated 5 months ago
- A few JavaScript examples that integrate a Firebase with the physical world.☆24Sep 2, 2014Updated 11 years ago
- Use p5 to build interactions that happen in the mobile browser☆19Jan 27, 2017Updated 9 years ago
- If you are trying to find the download url of specific datasets or some books☆14Apr 18, 2020Updated 5 years ago
- ☆18Jul 31, 2023Updated 2 years ago
- ☆11Mar 25, 2024Updated last year
- ☆20Jun 12, 2025Updated 9 months ago
- The codebase and datasets for the IJCAI 2021 paper "The Surprising Power of Graph Neural Networks with Random Node Initialization".☆22Jun 3, 2021Updated 4 years ago
- Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification☆12Oct 15, 2018Updated 7 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- Official implementation of "Learning by Sorting: Self-supervised Learning with Group Ordering Constraints." ICCV 2023☆16Nov 12, 2023Updated 2 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 7 months ago
- ☆14May 28, 2025Updated 9 months ago
- Refactoring Workshop☆47Apr 20, 2022Updated 3 years ago
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆14Dec 18, 2024Updated last year
- BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks☆21Mar 7, 2024Updated 2 years ago
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated 11 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 11 months ago