modelscope / easydistillLinks
a toolkit on knowledge distillation for large language models
☆110Updated last week
Alternatives and similar repositories for easydistill
Users that are interested in easydistill are comparing it to the libraries listed below
Sorting:
- ☆230Updated last year
- ☆48Updated last year
- 文本去重☆74Updated last year
- ☆97Updated last year
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆66Updated 2 years ago
- code for piccolo embedding model from SenseTime☆131Updated last year
- ☆144Updated last year
- LLaMA Factory Document☆140Updated last month
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆18Updated 8 months ago
- Deep Reasoning Translation (DRT) Project☆225Updated last month
- ☆40Updated last year
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆253Updated last week
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆40Updated last year
- 怎么训练一个LLM分词器☆151Updated 2 years ago
- 大语言模型训练和服务调研☆37Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆139Updated last year
- ☆82Updated last year
- ☆280Updated last month
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 8 months ago
- 中文基于满血DeepSeek-R1蒸馏数据集☆56Updated 4 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆175Updated 2 months ago
- Light local website for displaying performances from different chat models.☆87Updated last year
- ☆154Updated 2 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆81Updated 10 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- WritingBench: A Comprehensive Benchmark for Generative Writing☆94Updated 3 weeks ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆110Updated last month
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆228Updated 4 months ago
- ☆124Updated last year