Tree-based speculative decoding for Apple Silicon (MLX). ~10-15% faster than DFlash on code, ~1.5x over autoregressive. First MLX port with custom Metal kernels for hybrid model support.
☆133Apr 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for ddtree-mlx
Users that are interested in ddtree-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Run GEPA on your favorite non-python libraries.☆35Jan 22, 2026Updated 3 months ago
- Pure Triton kernels for Qwen3.5-27B inference on NVIDIA B200☆108Feb 28, 2026Updated 2 months ago
- Text with Open Interpreter, running locally on your Mac. Credit: Morisy☆23Oct 6, 2023Updated 2 years ago
- Automatically detect form fields in PDFs with CommonForms using ONNX Runtime Web☆25Oct 29, 2025Updated 6 months ago
- ☆13Mar 5, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Evolving durable programs☆20Jul 29, 2024Updated last year
- ☆83Mar 3, 2026Updated 2 months ago
- Complete automated setup guide for Qwen3-Coder-480B-A35B-Instruct model installation on Ubuntu with NVIDIA GPUs☆44Aug 3, 2025Updated 9 months ago
- Asynchronous pipeline parallel optimization☆21Feb 2, 2026Updated 3 months ago
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆104Apr 18, 2026Updated 2 weeks ago
- PicWish T2I, Photo Enhancer and Background Remover for Python☆25Jul 6, 2025Updated 10 months ago
- ☆22Sep 29, 2025Updated 7 months ago
- An opinionated MCP module for NestJS☆11Apr 10, 2026Updated 3 weeks ago
- Samples to show you how to create and deploy apps with Defang.☆11Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- tidx indexes Tempo chain data into a hybrid PostgreSQL + ClickHouse architecture for fast point lookups (OLTP) and lightning-fast analyti…☆79Apr 30, 2026Updated last week
- On-demand MCP tool discovery for AI agents☆25Updated this week
- Specification 1 (hierarchical style) for an agentic software development crew, for implementation with mainstream Agentic platforms like…☆31Apr 17, 2026Updated 3 weeks ago
- Minimal example of how to use FastAPI and Supabase Auth. Draws reference from: https://testdriven.io/blog/fastapi-jwt-auth/ and https://w…☆16Dec 12, 2022Updated 3 years ago
- The Open Source Voice Agent Platform. Orchestrate ultra-low latency AI pipelines for real-time conversations over WebRTC.☆43Updated this week
- ☆11Apr 28, 2022Updated 4 years ago
- Lightweight, model-agnostic chat history compression (trim + summarize) for AI assistants.☆23Sep 14, 2025Updated 7 months ago
- ☆37Oct 18, 2024Updated last year
- ☆10Jul 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- torchlogic is a pytorch framework for developing Neuro-Symbolic AI systems and implements Neural Reasoning Networks.☆18Sep 18, 2025Updated 7 months ago
- Food tour planner using LangChain DeepAgents, Google Maps API, and Tavily research. Explore multi-agent coordination patterns: task dele…☆85Nov 6, 2025Updated 6 months ago
- ☆13Mar 28, 2024Updated 2 years ago
- Dummy Data generator☆11Mar 24, 2024Updated 2 years ago
- A collection of prompts, skills, and agent rules for AI-powered development workflows.☆50Mar 11, 2026Updated last month
- Rust widget toolkit built on Reclutch☆11Mar 25, 2020Updated 6 years ago
- TypeScript port of Google's Agent Development Kit (ADK): An open-source, code-first toolkit for building, evaluating, and deploying AI ag…☆38Nov 4, 2025Updated 6 months ago
- TLS & API keys for your LLM APIs☆20Dec 17, 2025Updated 4 months ago
- pichuang personal website☆19Jun 10, 2025Updated 10 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆64Apr 14, 2026Updated 3 weeks ago
- flightcn is a flight route visualization component set built for the mapcn ecosystem☆97Updated this week
- A simple dashboard application using bootstrap and python Dash framework.☆10May 11, 2019Updated 6 years ago
- ☆13Jan 3, 2025Updated last year
- MCP Server implementation for Claude☆27Dec 1, 2024Updated last year
- generate informative knowledge graph from text using open source models , ollama☆23Sep 1, 2025Updated 8 months ago
- PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV in…☆65Feb 18, 2026Updated 2 months ago