UCSC-REAL / TokenCleaningLinks
[ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"
☆49Updated last month
Alternatives and similar repositories for TokenCleaning
Users that are interested in TokenCleaning are comparing it to the libraries listed below
Sorting:
- ☆65Updated 9 months ago
- ☆80Updated last month
- ☆36Updated last year
- ☆51Updated last year
- IntelliGuard: AI-powered code guardianship system. Enforces Intelligence Engineering Formatting Rules (IEFR) via iefr.json, automating qu…☆47Updated 2 weeks ago
- 强化学习-大语言模型☆65Updated last month
- ☆53Updated 2 months ago
- ☆81Updated last month
- GLT has presented the first attempt to accelerate GNN inference. Though promising, GLT encounters robustness and generalization issues wh…☆28Updated last year
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆40Updated 5 months ago
- ☆80Updated last month
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆22Updated 7 months ago
- This project performs large scale Monte Carlo simulations of the eigenvalues that appear in Johansen's null distribution. Results are wri…☆25Updated last month
- ☆61Updated last month
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆31Updated last year
- [ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"☆97Updated 4 months ago
- MGCF-Net for Phishing URLs Detection☆51Updated 2 months ago
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆78Updated last year
- MCP_Server for MediaCrawler. To Use MediaCrawler conveniently☆31Updated this week
- A Knowledge Base on Pre-made Dishes☆106Updated last month
- ☆106Updated 6 months ago
- A Chatbot with UI design is created, according to some certain datasets (can be replaced). Through statistical analysis and PINN model, i…☆27Updated 2 months ago
- Enhanced Credit Card Fraud Detection Using Machine Learning☆96Updated 7 months ago
- Official Code of Logits-Based-Finetuning☆87Updated last month
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆43Updated 3 months ago
- An iterative optimization system☆34Updated last month
- `cryptor` is a Go package for secure encryption and decryption using NaCl's `secretbox` from `golang.org/x/crypto`☆62Updated last month
- ☆33Updated 3 months ago
- Please visit our demonstration website for interactive demonstrations☆31Updated 10 months ago
- ☆41Updated 2 weeks ago