Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆49Oct 29, 2025Updated 4 months ago
Alternatives and similar repositories for transplant-vocab
Users that are interested in transplant-vocab are comparing it to the libraries listed below
Sorting:
- Moondream MCP Server in Python☆44Jul 2, 2025Updated 8 months ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 6 months ago
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated 9 months ago
- semantic search for your local files find by meaning, not keywords. 120+ file types, OCR, MCP server for AI agents. 100% private.☆54Feb 19, 2026Updated 2 weeks ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- realtime conversational dynamics☆19Mar 19, 2025Updated 11 months ago
- ☆21Dec 22, 2024Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆262Apr 23, 2024Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated last month
- Playing with CSM☆22Mar 14, 2025Updated 11 months ago
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21May 1, 2025Updated 10 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 9 months ago
- notes on langchain☆18Jan 23, 2024Updated 2 years ago
- LLM FX: A LLM Server Desktop Client free for everyone!☆37Updated this week
- Automatically remove watermarks from illustrations using AI (Stable Diffusion).☆20Dec 17, 2024Updated last year
- A open webui function for better R1 experience☆78Mar 7, 2025Updated last year
- Low-Rank Llama Custom Training☆23Mar 27, 2024Updated last year
- ☆23Updated this week
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆107Nov 5, 2025Updated 4 months ago
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- ☆23Sep 27, 2024Updated last year
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 11 months ago
- ☆24Jan 22, 2025Updated last year
- Trigger any command palette command via an obsidian:// uri☆27Jun 30, 2021Updated 4 years ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆638Updated this week
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 8 months ago
- ☆53Oct 10, 2025Updated 5 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 3 weeks ago
- Orpheus Chat WebUI☆75Mar 27, 2025Updated 11 months ago
- A QT GUI for large language models☆40Dec 27, 2023Updated 2 years ago
- Find better generation parameters for your LLM☆27Jun 9, 2024Updated last year
- ☆54May 28, 2025Updated 9 months ago
- ☆35May 9, 2024Updated last year
- ☆36Aug 10, 2025Updated 6 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 11 months ago