cartesia-ai / edge
On-device intelligence.
☆307Updated last week
Alternatives and similar repositories for edge:
Users that are interested in edge are comparing it to the libraries listed below
- Fast parallel LLM inference for MLX☆174Updated 8 months ago
- Joint speech-language model - respond directly to audio!☆370Updated 8 months ago
- FastMLX is a high performance production ready API to host MLX models.☆274Updated last week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆257Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆425Updated 6 months ago
- run paligemma in real time☆131Updated 10 months ago
- Long context evaluation for large language models☆202Updated 3 weeks ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆285Updated last week
- Run GGML models with Kubernetes.☆174Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆194Updated 10 months ago
- Multi-modal conversational AI (xRx) system☆300Updated 2 months ago
- Build your own visual reasoning model☆312Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆91Updated 2 weeks ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆224Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆105Updated 3 months ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆430Updated last month
- 🤖 Headless IDE for AI agents☆174Updated last month
- ☆106Updated last week
- Open source conversation framework and visual editor for structured Pipecat dialogues☆258Updated this week
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated 2 months ago
- procedural reasoning datasets☆534Updated this week
- PyTorch implementation of models from the Zamba2 series.☆178Updated 2 months ago
- Start a server from the MLX library.☆182Updated 8 months ago
- ☆89Updated 5 months ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆288Updated 3 weeks ago
- Claude Deep Research config for Claude Code.☆155Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆142Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆310Updated 3 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆299Updated 5 months ago