LLAMA Turboquant implementation with CUDA support
☆647Jun 4, 2026Updated this week
Alternatives and similar repositories for buun-llama-cpp
Users that are interested in buun-llama-cpp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run AI agents on real and isolated machines — own kernel, filesystem, and network — with <200ms boot. Local first, OCI compatible, pure R…☆125May 31, 2026Updated last week
- MCP Server to make searching openrouter easy☆21Feb 28, 2026Updated 3 months ago
- Mirror from gitlab☆11Jan 9, 2021Updated 5 years ago
- A Python tool for crawling and processing TradingView's PineScript V6 documentation. Built with Crawl4Ai framework, it extracts, cleans, …☆21Feb 22, 2026Updated 3 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆108Apr 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆31Nov 29, 2025Updated 6 months ago
- ☆10Nov 5, 2018Updated 7 years ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆61Feb 24, 2026Updated 3 months ago
- Pure C implementation of Voxtral-4B-TTS-2603☆103Mar 27, 2026Updated 2 months ago
- [ICLR 25 Spotlight] A testbed for agents and environments that can automatically improve models through data generation.☆28Mar 4, 2025Updated last year
- Convert a bitmap font in BDF, PCF, or SFD format to an OpenType Bitmap font using FontForge's API and bdfreader☆13Feb 10, 2024Updated 2 years ago
- llama.cpp fork with additional SOTA quants and improved performance☆22Updated this week
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆33Dec 29, 2025Updated 5 months ago
- Create single file standalone Python scripts with builtin frozen file system☆18Apr 9, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Blaze☆17Jun 19, 2021Updated 4 years ago
- an alsa-plugin that utiliizes LADSPA☆14Dec 18, 2025Updated 5 months ago
- Converts n64 save files to all available types of n64 save formats☆10Jan 9, 2023Updated 3 years ago
- Minimal C implementation of the Growcut area selection algorithm☆15Apr 4, 2020Updated 6 years ago
- ☆19Jul 6, 2015Updated 10 years ago
- Dashboard v5 Coming Soon!!☆64Feb 15, 2026Updated 3 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated last month
- ☆30Apr 29, 2026Updated last month
- Lego for GRPO☆30May 27, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Apr 26, 2025Updated last year
- ☆11Aug 23, 2025Updated 9 months ago
- Fully typesafe nextjs-ai-starter using 2 agents/tools out of the box https://nextjs.ai☆12Jun 14, 2023Updated 2 years ago
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated 2 years ago
- SyzgyDB: An embeddable vector database in Go for efficient disk-based storage and similarity searches, supporting various distance metric…☆11Nov 1, 2024Updated last year
- This includes 2 separate tutorial series for OpenAI swarm library each 10 files from basic to advanced☆14Jan 14, 2025Updated last year
- Tidymodels for Nested/Panel Data☆13Sep 30, 2023Updated 2 years ago
- Tribal Trouble with (8) Player support on a Huge Island.☆14Jan 26, 2020Updated 6 years ago
- 🚀 100% local RAG system with one-command setup. Your data never leaves your server. AGPL-3.0☆49May 27, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- High throughput streaming of Protobuf data from Kafka into DuckDB☆12Mar 4, 2026Updated 3 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Jul 8, 2024Updated last year
- A way to detect DBI frameworks, Debuggers and VMs.☆24Nov 17, 2020Updated 5 years ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆82Oct 16, 2024Updated last year
- ☆14Mar 5, 2024Updated 2 years ago
- A browser-based framework designed to aid protocol analysis☆13Sep 21, 2017Updated 8 years ago
- Persistent project knowledge graph for coding agents. MCP server with semantic search, in-process embeddings, and web explorer.☆172May 3, 2026Updated last month