modelscope / easydistillLinks
a toolkit on knowledge distillation for large language models
☆218Updated last month
Alternatives and similar repositories for easydistill
Users that are interested in easydistill are comparing it to the libraries listed below
Sorting:
- ☆235Updated last year
- a-m-team's exploration in large language modeling☆194Updated 6 months ago
- ☆300Updated 6 months ago
- The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.☆67Updated 2 years ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆432Updated this week
- ☆50Updated last year
- LLaMA Factory Document☆159Updated last week
- ☆173Updated 7 months ago
- Ling is a MoE LLM provided and open-sourced by InclusionAI.☆236Updated 6 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆272Updated 9 months ago
- Scaling Preference Data Curation via Human-AI Synergy☆132Updated 5 months ago
- ☆115Updated last year
- An automated pipeline for evaluating LLMs for role-playing.☆201Updated last year
- WritingBench: A Comprehensive Benchmark for Generative Writing☆143Updated last week
- Collect every awesome work about r1!☆425Updated 7 months ago
- ☆181Updated 2 years ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆118Updated 6 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆257Updated 11 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆73Updated 2 years ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆84Updated last year
- ☆54Updated last year
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆221Updated 4 months ago
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- code for piccolo embedding model from SenseTime☆143Updated last year
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆253Updated last year
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆97Updated last year
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 6 months ago
- 怎么训练一个LLM分词器☆154Updated 2 years ago
- ☆92Updated 6 months ago
- ☆107Updated 3 weeks ago