evilsocket / cakeLinks
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,903Updated last year
Alternatives and similar repositories for cake
Users that are interested in cake are comparing it to the libraries listed below
Sorting:
- Blazingly fast LLM inference.☆6,310Updated 2 weeks ago
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,774Updated 3 weeks ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,623Updated 3 months ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,563Updated last week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,227Updated last month
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,792Updated 6 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,843Updated 11 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,503Updated 5 months ago
- Implementation for MatMul-free LM.☆3,043Updated last month
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,929Updated 2 months ago
- Local realtime voice AI☆2,411Updated last month
- Deep learning at the speed of light.☆2,663Updated this week
- Local AI API Platform☆2,763Updated 5 months ago
- Minimalist ML framework for Rust☆18,941Updated this week
- A fast llama2 decoder in pure Rust.☆1,058Updated 2 years ago
- A blazing fast inference solution for text embeddings models☆4,345Updated last week
- CoreNet: A library for training deep neural networks☆7,025Updated 2 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,327Updated last year
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,220Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,156Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,964Updated 3 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,106Updated 7 months ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆963Updated 2 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,056Updated last month
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆4,266Updated 8 months ago
- Examples in the MLX framework☆8,085Updated 2 weeks ago
- AICI: Prompts as (Wasm) Programs☆2,059Updated 11 months ago
- Agentic components of the Llama Stack APIs☆4,285Updated 4 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆7,114Updated last year
- official repository of aiXcoder-7B Code Large Language Model☆2,272Updated 5 months ago