tum-ai / number-token-lossView external linksLinks
A regression-alike loss to improve numerical reasoning in language models - ICML 2025
☆28Aug 18, 2025Updated 5 months ago
Alternatives and similar repositories for number-token-loss
Users that are interested in number-token-loss are comparing it to the libraries listed below
Sorting:
- Official Code for Rectified LpJEPA: Joint-Embedding Predictive Architectures with Sparse and Maximum-Entropy Representations☆52Updated this week
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆30Jul 9, 2025Updated 7 months ago
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖☆10Apr 17, 2025Updated 10 months ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 6 months ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated last year
- 기획자와 마케터를 위한 이벤트 댓글 분석 - feat. 인프런 새해 다짐 이벤트☆11Apr 22, 2020Updated 5 years ago
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 2 years ago
- [ICCV2025] Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning☆23Nov 13, 2025Updated 3 months ago
- Domain adaptation framework for segmentation via reinforcement learning.☆11Oct 13, 2025Updated 4 months ago
- ☆30Dec 23, 2025Updated last month
- Implementation of the BatchTopK activation function for training sparse autoencoders (SAEs)☆60Jul 24, 2025Updated 6 months ago
- A tool to explore ideas generated from artificial intelligence chats.☆10Apr 3, 2023Updated 2 years ago
- >>PhysWikiQuiz<< - a Physics Question Generation and Interrogation System☆11Feb 25, 2023Updated 2 years ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆21Jul 4, 2025Updated 7 months ago
- This repository contains the source code for the cloud.gov.au website.☆12Dec 7, 2022Updated 3 years ago
- [2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트☆14Jun 11, 2022Updated 3 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- Open-source intelligence (OSINT)☆15Mar 1, 2024Updated last year
- A scalable data preprocessing framework built on PySpark for LLM training☆21Dec 9, 2025Updated 2 months ago
- ☆15Feb 28, 2023Updated 2 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- ☆10Nov 1, 2019Updated 6 years ago
- [NeurIPS 2024] Official code for the paper 'RankUp: Boosting Semi-Supervised Regression with an Auxiliary Ranking Classifier'☆14Aug 22, 2025Updated 5 months ago
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 2 months ago
- ☆12Feb 9, 2022Updated 4 years ago
- Code for the paper "Multimodal brain age estimation using interpretable adaptive population-graph learning"☆10Dec 4, 2023Updated 2 years ago
- ☆20Jun 12, 2025Updated 8 months ago
- ☆16Sep 4, 2025Updated 5 months ago
- [Findings of ACL-2023] This is the official implementation of On the Difference of BERT-style and CLIP-style Text Encoders.☆14Jun 7, 2023Updated 2 years ago
- Data set of APESS2018 for one of the final projects - Steel Girder Crack Identification☆12Oct 15, 2018Updated 7 years ago
- ☆13Mar 25, 2022Updated 3 years ago
- COVID-19 corpus with annotated biomedical entities.☆11Jun 2, 2021Updated 4 years ago