golololologol / LLM-DistilleryView external linksLinks
A pipeline for LLM knowledge distillation
☆112Apr 2, 2025Updated 10 months ago
Alternatives and similar repositories for LLM-Distillery
Users that are interested in LLM-Distillery are comparing it to the libraries listed below
Sorting:
- An Open Source Toolkit For LLM Distillation☆860Dec 21, 2025Updated last month
- Best practices for distilling large language models.☆604Feb 1, 2024Updated 2 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated 10 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆250Mar 13, 2025Updated 11 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Feb 29, 2024Updated last year
- This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicit…☆1,252Mar 9, 2025Updated 11 months ago
- ☆28May 24, 2025Updated 8 months ago
- Python code implementing the algorithm designed by Mueen at UC Riverside. The description of the paper can be found in the paper - "Searc…☆13Oct 13, 2014Updated 11 years ago
- Work with your business data using natural language☆19Nov 20, 2024Updated last year
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆11Nov 20, 2024Updated last year
- 面向智能家居场景的嵌入式 RPC 框架☆14May 5, 2024Updated last year
- ☆48Aug 29, 2024Updated last year
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- some common Huggingface transformers in maximal update parametrization (µP)☆87Mar 14, 2022Updated 3 years ago
- ☆56Nov 6, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- 智能客服质检/对话分析☆13Mar 20, 2024Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 6 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10May 30, 2024Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- Open Human Ontology: A collaborative project to build a comprehensive, open, and structured human ontology for research and applications …☆121Oct 13, 2025Updated 4 months ago
- Examples for QinYan GLMs☆13Sep 3, 2024Updated last year
- Exploring Applications of GRPO☆250Aug 25, 2025Updated 5 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Mar 7, 2025Updated 11 months ago
- A compact LLM pretrained in 9 days by using high quality data☆339Apr 9, 2025Updated 10 months ago
- ☆41Apr 30, 2025Updated 9 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 2 months ago
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆21Aug 26, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- ☆15Jun 20, 2024Updated last year
- ☆18Apr 18, 2025Updated 9 months ago
- Here we provide and collect many functions to generate math problem and step by step solutions for LLM training☆18Jun 21, 2023Updated 2 years ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 3 months ago