modelscope / easydistillLinks

a toolkit on knowledge distillation for large language models

☆200

Alternatives and similar repositories for easydistill

Users that are interested in easydistill are comparing it to the libraries listed below

Sorting:

Chinese-Tiny-LLM / Chinese-Tiny-LLM
☆235Updated last year
the-seeds / LLaMA-Factory-Doc
LLaMA Factory Document
☆154Updated 2 weeks ago
OpenSenseNova / piccolo-embedding
code for piccolo embedding model from SenseTime
☆143Updated last year
a-m-team / a-m-models
a-m-team's exploration in large language modeling
☆192Updated 5 months ago
hengjiUSTC / learn-llm
☆115Updated last year
percent4 / llm_math_solver
本项目用于大模型数学解题能力方面的数据集合成，模型训练及评测，相关文章记录。
☆97Updated last year
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆265Updated 9 months ago
RUC-GSAI / Yulan-GARDEN
Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"
☆84Updated last year
FlagAI-Open / OpenSeek
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next…
☆239Updated last week
Alibaba-NLP / OmniSearch
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆393Updated 6 months ago
modelscope / Trinity-RFT
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…
☆404Updated this week
FlagOpen / Infinity-Instruct
☆49Updated last year
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
PKU-Baichuan-MLSystemLab / PAS
☆54Updated last year
Tongyi-Zhiwen / Qwen-Doc
☆301Updated 5 months ago
SuperGPQA / SuperGPQA
☆172Updated 6 months ago
inclusionAI / Ling
Ling is a MoE LLM provided and open-sourced by InclusionAI.
☆233Updated 6 months ago
boson-ai / RPBench-Auto
An automated pipeline for evaluating LLMs for role-playing.
☆202Updated last year
SkyworkAI / Skywork-Reward-V2
Scaling Preference Data Curation via Human-AI Synergy
☆128Updated 4 months ago
bigai-nlco / TokenSwift
[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation
☆118Updated 6 months ago
CASIA-LM / ChineseWebText
☆180Updated 2 years ago
PALIN2018 / BrowseComp-ZH
☆127Updated 6 months ago
THUDM / LongAlign
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆256Updated 11 months ago
CLUEbenchmark / SuperCLUE-Math6
SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅
☆60Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆192Updated last year
multimodal-art-projection / Megatron-LM-NEO
☆40Updated last year
RUC-GSAI / YuLan-Mini
A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.
☆222Updated 3 months ago
Alibaba-NLP / MaskSearch
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆146Updated 5 months ago
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆154Updated 2 years ago
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆58Updated last year