evilsocket / cakeLinks
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,882Updated 11 months ago
Alternatives and similar repositories for cake
Users that are interested in cake are comparing it to the libraries listed below
Sorting:
- Blazingly fast LLM inference.☆6,105Updated this week
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,639Updated last week
- Local AI API Platform☆2,762Updated 2 months ago
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,754Updated 3 months ago
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,825Updated 8 months ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,502Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,609Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,953Updated 4 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,226Updated 4 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,566Updated this week
- A self-organizing file system with llama 3☆5,633Updated last month
- Local realtime voice AI☆2,363Updated 6 months ago
- AirLLM 70B inference with single 4GB GPU☆5,920Updated 2 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,221Updated last week
- Implementation for MatMul-free LM.☆3,032Updated 2 months ago
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,905Updated 2 months ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,878Updated 2 weeks ago
- Bionic is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality☆2,248Updated last month
- A vector search SQLite extension that runs anywhere!☆6,143Updated 7 months ago
- A fast multimodal LLM for real-time voice☆4,198Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,118Updated this week
- Llama-3 agents that can browse the web by following instructions and talking to you☆1,411Updated 9 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,323Updated last year
- Fully private LLM chatbot that runs entirely with a browser with no server needed. Supports Mistral and LLama 3.☆2,668Updated last year
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,917Updated last week
- OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophist…☆1,679Updated last year
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆890Updated 3 weeks ago
- Turn any glasses into AI-powered smart glasses☆3,758Updated 2 months ago
- Making the community's best AI chat models available to everyone.☆1,982Updated 7 months ago
- AICI: Prompts as (Wasm) Programs☆2,051Updated 8 months ago