tum-ai / number-token-lossView external linksLinks
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
☆28Aug 18, 2025Updated 5 months ago
Alternatives and similar repositories for number-token-loss
Users that are interested in number-token-loss are comparing it to the libraries listed below
Sorting:
- Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations☆52Updated this week
- Official implementation of "EchoTracker: Advancing Myocardial Point Tracking in Echocardiography". (MICCAI 2024)☆56Nov 20, 2024Updated last year
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆30Jul 9, 2025Updated 7 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Visualizing 230 years of US Census data☆12Feb 23, 2020Updated 5 years ago
- ☆12Mar 15, 2023Updated 2 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 7 months ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- ☆35Updated this week
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated last year
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 6 months ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆20Jan 27, 2026Updated 3 weeks ago
- Domain adaptation framework for segmentation via reinforcement learning.☆11Oct 13, 2025Updated 4 months ago
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆60Jul 24, 2025Updated 6 months ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 5 months ago
- ☆17May 3, 2025Updated 9 months ago
- Code for the paper "Multimodal brain age estimation using interpretable adaptive population-graph learning"☆10Dec 4, 2023Updated 2 years ago
- A Bunyan stream to send events to Seq☆11May 7, 2025Updated 9 months ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- ☆12Jan 25, 2026Updated 3 weeks ago
- Amazon Bedrock 의 Nova, Claude 3.7 모델을 활용하여 pdf 도면을 파싱 합니다.☆12May 19, 2025Updated 8 months ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- PyTorch implementation of Bezier simplex fitting☆12Feb 9, 2026Updated last week
- ☆16Sep 4, 2025Updated 5 months ago
- 🇰🇷 Korean LLM Datasets | Pre-training, SFT, DPO, RLHF, CoT | 한국어 LLM 데이터셋 큐레이션☆31Jan 20, 2026Updated 3 weeks ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- ☆13Mar 25, 2022Updated 3 years ago
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆21Jul 4, 2025Updated 7 months ago
- Build a Slurm Cluster using SaltStack in virtual machines☆12Nov 26, 2018Updated 7 years ago
- Lightweight Python wrapper around the DuckDB extension, httpserver (extension developed by @quackscience)☆17Sep 24, 2025Updated 4 months ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago