On-device intelligence.
☆407Mar 24, 2025Updated last year
Alternatives and similar repositories for edge
Users that are interested in edge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official Cartesia client for Python.☆121Updated this week
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- The JavaScript client for the Cartesia API.☆131May 20, 2026Updated last week
- On-device semantic search over Apple WWDC 2025 docs using MLX embeddings — SwiftUI app (WWDC OMT 2025)☆76Jun 12, 2025Updated 11 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆27Jul 9, 2024Updated last year
- ☆14Dec 15, 2025Updated 5 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆30Feb 27, 2024Updated 2 years ago
- ☆54May 20, 2024Updated 2 years ago
- PyTorch implementation of models from the Zamba2 series.☆192Jan 23, 2025Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 6 months ago
- A fast multimodal LLM for real-time voice☆4,424Dec 12, 2025Updated 5 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- FastMLX is a high performance production ready API to host MLX models.☆357Mar 18, 2025Updated last year
- Understand and test language model architectures on synthetic tasks.☆268Mar 22, 2026Updated 2 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- ☆213Dec 11, 2024Updated last year
- train with kittens!☆66Oct 25, 2024Updated last year
- Joint speech-language model - respond directly to audio!☆372Jul 1, 2024Updated last year
- Open Source framework for voice and multimodal conversational AI☆12,468Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,439Apr 30, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A basic voice agent built with Node.js agents framework☆35Oct 1, 2025Updated 7 months ago
- Joint speech-language model - respond directly to audio!☆30May 13, 2024Updated 2 years ago
- ☆345Mar 5, 2026Updated 2 months ago
- Run Time Series Foundation Models on Apple Silicon☆32Feb 27, 2026Updated 3 months ago
- Materials for the LLM Evals Workshop from Weights & BIases☆15Feb 24, 2025Updated last year
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆254Jun 6, 2025Updated 11 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated 2 years ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year
- ☆21Sep 6, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆33Nov 21, 2025Updated 6 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆10,261May 16, 2026Updated last week
- first base model for full-duplex conversational audio☆1,788Jan 5, 2025Updated last year
- 🚀 Efficient implementations for emerging model architectures☆5,139Updated this week
- ☆1,395Jan 29, 2026Updated 3 months ago
- MedConceptsQA: Open source medical concepts QA benchmark☆18Dec 30, 2024Updated last year
- ☆25Aug 20, 2024Updated last year