On-device intelligence.
☆399Mar 24, 2025Updated last year
Alternatives and similar repositories for edge
Users that are interested in edge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official Cartesia client for Python.☆122Mar 26, 2026Updated 2 weeks ago
- The JavaScript client for the Cartesia API.☆129Updated this week
- On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025)☆76Jun 12, 2025Updated 10 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Jul 9, 2024Updated last year
- ☆19Dec 4, 2025Updated 4 months ago
- ☆13Dec 15, 2025Updated 3 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆30Feb 27, 2024Updated 2 years ago
- ☆54May 20, 2024Updated last year
- PyTorch implementation of models from the Zamba2 series.☆193Jan 23, 2025Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Oct 28, 2025Updated 5 months ago
- A fast multimodal LLM for real-time voice☆4,396Dec 12, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆352Mar 18, 2025Updated last year
- Understand and test language model architectures on synthetic tasks.☆265Mar 22, 2026Updated 3 weeks ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆211Dec 11, 2024Updated last year
- train with kittens!☆64Oct 25, 2024Updated last year
- Joint speech-language model - respond directly to audio!☆373Jul 1, 2024Updated last year
- Open Source framework for voice and multimodal conversational AI☆11,217Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,421Apr 21, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A basic voice agent built with Node.js agents framework☆35Oct 1, 2025Updated 6 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆667Feb 6, 2026Updated 2 months ago
- ☆342Mar 5, 2026Updated last month
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- Run Time Series Foundation Models on Apple Silicon☆32Feb 27, 2026Updated last month
- Materials for the LLM Evals Workshop from Weights & BIases☆15Feb 24, 2025Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆251Jun 6, 2025Updated 10 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆33Nov 21, 2025Updated 4 months ago
- ☆20Sep 6, 2025Updated 7 months ago