UCSC-REAL / TokenCleaningLinks
[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"
☆46Updated 5 months ago
Alternatives and similar repositories for TokenCleaning
Users that are interested in TokenCleaning are comparing it to the libraries listed below
Sorting:
- Code and datasets for the paper "Bridging Neural and Symbolic Representations with Transitional Dictionary Learning".☆46Updated last year
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- ☆81Updated 3 weeks ago
- Official Code of Logits-Based-Finetuning☆91Updated 5 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 9 months ago
- 强化学习-大语言模型☆68Updated 5 months ago
- Our imbalance-aware ViT model achieves 0.91035 accuracy on the public leaderboard and 0.87750 on the private leaderboard of the ML2022Spr…☆26Updated 6 months ago
- Enhanced Credit Card Fraud Detection Using Machine Learning☆97Updated 11 months ago
- ☆76Updated 5 months ago
- This is the official repo for paper "StableMotion: Repurposing Diffusion-Based Image Priors for Motion Estimation"☆44Updated 4 months ago
- ☆62Updated last year
- ☆80Updated 6 months ago
- ☆49Updated last year
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆75Updated last year
- A Knowledge Base on Pre-made Dishes☆105Updated 5 months ago
- ☆51Updated 6 months ago
- ☆40Updated last year
- AnomalyControl: Learning Cross-modal Semantic Features for Controllable Anomaly Synthesis☆41Updated 4 months ago
- Neobanker FinTalk-AI: A Grounded Orchestration Framework for Multi-Agent Collaboration on Financial Tasks Leveraging the OSWorld Environm…☆39Updated 4 months ago
- By converting single-channel grayscale images into multi-channel images through various data enhancement techniques, SimOTM enhances the …☆30Updated 6 months ago
- `cryptor` is a Go package for secure encryption and decryption using NaCl's `secretbox` from `golang.org/x/crypto`☆60Updated 6 months ago
- Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…☆41Updated 6 months ago
- ☆40Updated 8 months ago
- ☆41Updated 6 months ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆72Updated 3 weeks ago
- Concise Evaluation Benchmark for Large Language Models☆25Updated 4 months ago
- ☆36Updated last year
- Source code for the paper: "Turning Dynamic Time Warping into Interpretable Recurrent Neural Network"☆33Updated 7 months ago
- QRec is an algorithm that helps you quickly find the largest fixed-aspect, axis-aligned rectangle that can be inscribed in any given poly…☆27Updated 5 months ago
- This repository documents my learning journey on the Kaggle platform, including model study notes and implementations of models actually …☆33Updated 2 weeks ago