Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆52Oct 29, 2025Updated 6 months ago
Alternatives and similar repositories for transplant-vocab
Users that are interested in transplant-vocab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 7 months ago
- ☆72Mar 23, 2026Updated last month
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆25Mar 11, 2026Updated last month
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- A minimal CLI tool for piping anything into an LLM.☆21Jan 1, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆46Jun 11, 2025Updated 10 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆268Apr 23, 2024Updated 2 years ago
- ☆19Jul 4, 2025Updated 9 months ago
- A open webui function for better R1 experience☆77Mar 7, 2025Updated last year
- Moondream MCP Server in Python☆46Jul 2, 2025Updated 9 months ago
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 11 months ago
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 10 months ago
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆809Updated this week
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Nov 5, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- German "Who Wants To Be A Millionaire" LLM Benchmarking.☆50Apr 14, 2026Updated 2 weeks ago
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆19Mar 14, 2025Updated last year
- PyTorch code of "Training a Vision Transformer from scratch in less than 24 hours with 1 GPU" (HiTY workshop at Neurips 2022)☆27Aug 30, 2023Updated 2 years ago
- realtime conversational dynamics☆19Mar 19, 2025Updated last year
- Let's have some retro gaming fun with AI! Join the discord: https://discord.gg/5xXzkMu8Zk☆78Nov 19, 2025Updated 5 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆2,194Updated this week
- Qwen LLM in the mac menu bar <3☆27Mar 12, 2025Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- ☆10Dec 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆23Sep 27, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Automatically remove watermarks from illustrations using AI (Stable Diffusion).☆21Dec 17, 2024Updated last year
- ☆24Jan 22, 2025Updated last year
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input a target size and the toolchain w…☆120Updated this week
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- ☆21Dec 22, 2024Updated last year
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆24Mar 26, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆53Jan 18, 2024Updated 2 years ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- flow-merge is a powerful Python library that enables seamless merging of multiple transformer-based language models using the most popula…☆20Feb 12, 2025Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- semantic search for your local files find by meaning, not keywords. 120+ file types, OCR, MCP server for AI agents. 100% private.☆60Feb 19, 2026Updated 2 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago