jukofyork / transplant-vocabView external linksLinks
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆49Oct 29, 2025Updated 3 months ago
Alternatives and similar repositories for transplant-vocab
Users that are interested in transplant-vocab are comparing it to the libraries listed below
Sorting:
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 9 months ago
- Moondream MCP Server in Python☆45Jul 2, 2025Updated 7 months ago
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated 8 months ago
- realtime conversational dynamics☆19Mar 19, 2025Updated 10 months ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- ☆21Dec 22, 2024Updated last year
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆262Apr 23, 2024Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated 3 weeks ago
- LLM FX: A LLM Server Desktop Client free for everyone!☆33Dec 19, 2025Updated last month
- Playing with CSM☆22Mar 14, 2025Updated 11 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 8 months ago
- notes on langchain☆18Jan 23, 2024Updated 2 years ago
- Automatically remove watermarks from illustrations using AI (Stable Diffusion).☆20Dec 17, 2024Updated last year
- A open webui function for better R1 experience☆78Mar 7, 2025Updated 11 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated 10 months ago
- ☆23Dec 9, 2025Updated 2 months ago
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Nov 5, 2025Updated 3 months ago
- ☆23Sep 27, 2024Updated last year
- ☆24Jan 22, 2025Updated last year
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆23Apr 1, 2025Updated 10 months ago
- Web Interface for Vision Language Models Including InternVLM2☆25Jul 29, 2024Updated last year
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 9 months ago
- Trigger any command palette command via an obsidian:// uri☆27Jun 30, 2021Updated 4 years ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆633Feb 10, 2026Updated last week
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 8 months ago
- Browse, search, and visualize ONNX models.☆34May 6, 2025Updated 9 months ago
- ☆52Oct 10, 2025Updated 4 months ago
- A chess arena for large language models☆39May 22, 2025Updated 8 months ago
- Orpheus Chat WebUI☆76Mar 27, 2025Updated 10 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated 11 months ago
- ☆36Aug 10, 2025Updated 6 months ago
- Find better generation parameters for your LLM☆27Jun 9, 2024Updated last year
- A QT GUI for large language models☆39Dec 27, 2023Updated 2 years ago
- ☆54May 28, 2025Updated 8 months ago