tum-ai / number-token-lossLinks
A regression-alike loss to improve numerical reasoning in language models
☆17Updated 3 weeks ago
Alternatives and similar repositories for number-token-loss
Users that are interested in number-token-loss are comparing it to the libraries listed below
Sorting:
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- Recycling diverse models☆44Updated 2 years ago
- ☆13Updated 2 years ago
- Unofficial Implementation of Selective Attention Transformer☆17Updated 7 months ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 3 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆34Updated last year
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 8 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆29Updated 7 months ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆19Updated 2 years ago
- ☆19Updated 4 months ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Updated 2 years ago
- ☆32Updated 5 months ago
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆20Updated 4 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated last year
- ☆27Updated last year
- Official implementation for Sparse MetA-Tuning (SMAT)☆16Updated last week
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 8 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated last month
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 3 months ago
- Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (h…☆14Updated 9 months ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆24Updated 2 months ago
- Code for T-MARS data filtering☆35Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆64Updated 9 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆31Updated last year
- ☆28Updated 3 months ago
- ☆20Updated 7 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- Control LLM☆16Updated 2 months ago