On-device intelligence.
☆401Mar 24, 2025Updated last year
Alternatives and similar repositories for edge
Users that are interested in edge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official Cartesia client for Python.☆121Updated this week
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- Cartesia Line SDK for voice agents.☆99Updated this week
- The JavaScript client for the Cartesia API.☆131Updated this week
- On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025)☆76Jun 12, 2025Updated 10 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆27Jul 9, 2024Updated last year
- ☆19Dec 4, 2025Updated 5 months ago
- ☆13Dec 15, 2025Updated 4 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆30Feb 27, 2024Updated 2 years ago
- ☆54May 20, 2024Updated last year
- PyTorch implementation of models from the Zamba2 series.☆194Jan 23, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 6 months ago
- A fast multimodal LLM for real-time voice☆4,412Dec 12, 2025Updated 4 months ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆352Mar 18, 2025Updated last year
- Understand and test language model architectures on synthetic tasks.☆265Mar 22, 2026Updated last month
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆212Dec 11, 2024Updated last year
- train with kittens!☆64Oct 25, 2024Updated last year
- Joint speech-language model - respond directly to audio!☆372Jul 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open Source framework for voice and multimodal conversational AI☆11,687Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,434Apr 30, 2026Updated last week
- A basic voice agent built with Node.js agents framework☆35Oct 1, 2025Updated 7 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆672Feb 6, 2026Updated 3 months ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated last year
- ☆345Mar 5, 2026Updated 2 months ago
- Run Time Series Foundation Models on Apple Silicon☆32Feb 27, 2026Updated 2 months ago
- Materials for the LLM Evals Workshop from Weights & BIases☆15Feb 24, 2025Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆252Jun 6, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- ☆33Nov 21, 2025Updated 5 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,111Apr 28, 2026Updated last week
- first base model for full-duplex conversational audio☆1,788Jan 5, 2025Updated last year
- 🚀 Efficient implementations for emerging model architectures☆5,032Updated this week
- ☆1,354Jan 29, 2026Updated 3 months ago