evilsocket / cakeLinks
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,882Updated 11 months ago
Alternatives and similar repositories for cake
Users that are interested in cake are comparing it to the libraries listed below
Sorting:
- Blazingly fast LLM inference.☆6,141Updated this week
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,688Updated this week
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,513Updated this week
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,754Updated 3 months ago
- Local AI API Platform☆2,760Updated 3 months ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆31,743Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,903Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,611Updated last month
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,978Updated last week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,685Updated this week
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,221Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,207Updated 5 months ago
- Deep learning at the speed of light.☆2,550Updated this week
- Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild☆2,783Updated this week
- Local realtime voice AI☆2,370Updated 7 months ago
- A nanoGPT pipeline packed in a spreadsheet☆2,126Updated last year
- Minimal LLM inference in Rust☆1,013Updated 11 months ago
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,951Updated 5 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,325Updated last year
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,888Updated 3 weeks ago
- An open source AI wearable device that captures what you say and hear in the real world and then transcribes and stores it on your own se…☆3,297Updated last year
- AICI: Prompts as (Wasm) Programs☆2,051Updated 8 months ago
- A fast multimodal LLM for real-time voice☆4,216Updated last month
- An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.☆1,581Updated last year
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙☆1,323Updated this week
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆1,569Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,119Updated this week
- On-device Speech Recognition for Apple Silicon☆5,107Updated last week
- A vector search SQLite extension that runs anywhere!☆6,237Updated 8 months ago
- Convert any PDF into a podcast episode!☆2,476Updated 10 months ago