Lossless DFlash speculative decoding for MLX on Apple Silicon
☆731Jun 11, 2026Updated last week
Alternatives and similar repositories for dflash-mlx
Users that are interested in dflash-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM training on Apple's Neural Engine — native Obj-C, private APIs, zero GPU. Dynamic weight pipeline for training without kernel recompi…☆55Mar 17, 2026Updated 3 months ago
- Pure MLX implementations of UMAP, t-SNE, PaCMAP, TriMap, DREAMS, CNE, MMAE, and NNDescent for Apple Silicon. Metal GPU for computation an…☆86Mar 20, 2026Updated 2 months ago
- ☆174Mar 30, 2026Updated 2 months ago
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆88Nov 11, 2025Updated 7 months ago
- ☆213Mar 24, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Mobile & desktop command center for AI coding agents. Control Claude, Codex, Gemini sessions across machines from your phone — secured by…☆30May 31, 2026Updated 2 weeks ago
- Code for paper "Analog Foundation Models"☆35Mar 25, 2026Updated 2 months ago
- Flash-MoE sidecar slot-bank runtime for large GGUF MoE models on Apple Silicon — llama.cpp fork☆106May 16, 2026Updated last month
- Train Large Language Models on MLX.☆377Jun 11, 2026Updated last week
- ☆41Apr 9, 2026Updated 2 months ago
- PMetal: high-performance Apple Silicon framework for local LLM inference, LoRA/QLoRA fine-tuning, serving, quantization, and MLX/Metal ac…☆296Jun 5, 2026Updated 2 weeks ago
- Autonomous AI orchestration architecture combining Google Antigravity with Jules API for hands-free development workflows. MCP integratio…☆39Apr 1, 2026Updated 2 months ago
- Stop hook for Claude Code that keeps the agent working until all plans and user requests are 100% complete☆503Mar 11, 2026Updated 3 months ago
- Run LLMs with MLX☆5,825Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Swift package for reading and writing Safetensors files.☆13Feb 6, 2026Updated 4 months ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆101Mar 6, 2026Updated 3 months ago
- an AI that lives on your computer and does stuff for you. Public beta.☆100May 3, 2026Updated last month
- Dockerized Screaming Frog SEO Spider☆13May 22, 2023Updated 3 years ago
- Collection of inspirational laws from several studies domains☆17Feb 4, 2026Updated 4 months ago
- ☆40Jan 30, 2026Updated 4 months ago
- Find Answers Based on Research Papers☆27Jun 18, 2025Updated last year
- Xilly Game Mode is a competitive-grade optimization utility designed to instantly reallocate your PC's resources for maximum gaming perfo…☆20Feb 13, 2026Updated 4 months ago
- My collection of dotfiles☆14Apr 22, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Interactive chat application that enables users to have conversations with any website's content using Groq's fast inference capabilities…☆25Sep 25, 2025Updated 8 months ago
- thinking tool for claude desktop/mcp clients using Deepseek reasoner☆56Jan 28, 2025Updated last year
- A sharp command-line tool for AI-assisted coding.☆39Jun 12, 2025Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Apr 10, 2026Updated 2 months ago
- ☆10Jun 16, 2025Updated last year
- Your TLA+ spec and your TypeScript code drift apart. This kit makes that impossible.☆104Apr 4, 2026Updated 2 months ago
- Peakflo Unified Model Context Protocol (pfMCP)☆19Updated this week
- Enemies for your LLM☆37Jan 20, 2026Updated 4 months ago
- 功能上来说就是Claude Code webUI和frp的结合体,简化配置和部署☆63Nov 7, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An experimental pi extension that runs and manages qwen with llama.cpp☆144May 11, 2026Updated last month
- 👩🚒 A lightweight framework for CloudFlare Workers☆13Jun 16, 2022Updated 4 years ago
- Lux is an open-source framework for building multi-agent, swarmed intelligence built by Spectral Labs. With Beams, Prisms, Lenses, and Si…☆19Feb 25, 2025Updated last year
- Open-source Boilerplate project This repo contains an open-source Networked multiplayer Unity game project developed by Alexandria Pagra…☆15Jun 28, 2021Updated 4 years ago
- Calls a function repeatedly while a condition returns true and then resolves the promise☆38Aug 29, 2024Updated last year
- An end-to-end voice assistant running entirely on Apple Silicon.☆63Nov 29, 2025Updated 6 months ago
- ☆11Sep 18, 2023Updated 2 years ago