Easy to use, High Performant Knowledge Distillation for LLMs
☆97May 5, 2025Updated last year
Alternatives and similar repositories for distillKitPlus
Users that are interested in distillKitPlus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- An Open Source Toolkit For LLM Distillation☆942May 12, 2026Updated last week
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- ☆16Dec 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Best practices for distilling large language models.☆626Feb 1, 2024Updated 2 years ago
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆44Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- synthetic dataset generation workflow using local file resources for finetuning llms.☆83Oct 9, 2025Updated 7 months ago
- A minimal CLI tool for piping anything into an LLM.☆21Jan 1, 2026Updated 4 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆57Feb 10, 2025Updated last year
- ☆79Feb 18, 2026Updated 3 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆107Oct 31, 2024Updated last year
- A pipeline for LLM knowledge distillation☆114May 7, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Genertaes control vectors for use with llama.cpp in GGUF format.☆41Mar 19, 2025Updated last year
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated 2 months ago
- ☆12Dec 28, 2021Updated 4 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- A collection of optimized ComfyUI-based cloud inference endpoints, built on ComfyDeploy and Modal☆16Nov 5, 2024Updated last year
- Live-editable codeblocks for any language.☆16Dec 13, 2025Updated 5 months ago
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 10 months ago
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆65Updated this week
- ☆18Apr 23, 2025Updated last year
- Because it's there.☆16Sep 22, 2024Updated last year
- ☆49Mar 9, 2025Updated last year
- A guidance language for controlling large language models.☆43Jun 9, 2023Updated 2 years ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 5 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆52Oct 29, 2025Updated 6 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Nov 13, 2023Updated 2 years ago
- Documentation at☆14Mar 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- Well documented examples of running distributed training jobs on Modal☆28Updated this week
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆23Nov 26, 2023Updated 2 years ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆13Mar 19, 2024Updated 2 years ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆136Mar 4, 2026Updated 2 months ago
- Repository for experiments with several approaches to fine-tune model, pretrained on the CLIP: https://openai.com/blog/clip/☆15May 28, 2021Updated 4 years ago
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago