tum-ai / number-token-lossLinks
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
☆27Updated 4 months ago
Alternatives and similar repositories for number-token-loss
Users that are interested in number-token-loss are comparing it to the libraries listed below
Sorting:
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆48Updated 3 weeks ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated 8 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- A collection of AWESOME language modeling techniques on tabular data applications.☆32Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆67Updated 2 years ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated 3 months ago
- ☆146Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆56Updated 7 months ago
- ☆48Updated 10 months ago
- ☆40Updated 7 months ago
- ☆15Updated last year
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- [Technical Report] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with …☆63Updated last year
- ☆19Updated 10 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19Updated 2 years ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆41Updated 2 years ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆97Updated last year
- ☆50Updated 11 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆34Updated 2 years ago
- Unofficial Implementation of Selective Attention Transformer☆19Updated last year
- ☆82Updated last month
- ☆16Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆19Updated 9 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆20Updated 7 months ago
- ☆17Updated 5 months ago
- ☆25Updated last year
- Exploration of automated dataset selection approaches at large scales.☆53Updated 10 months ago
- Control LLM☆22Updated 9 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆57Updated 11 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year