evilsocket / cake
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,706Updated 2 months ago
Alternatives and similar repositories for cake:
Users that are interested in cake are comparing it to the libraries listed below
- Blazingly fast LLM inference.☆4,826Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,492Updated this week
- A framework for Claude Opus to intelligently orchestrate subagents.☆4,185Updated 6 months ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆1,622Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆5,509Updated last week
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,147Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week
- A language model programming library.☆5,556Updated 3 weeks ago
- High-speed Large Language Model Serving on PCs with Consumer-grade GPUs☆8,050Updated 4 months ago
- The easiest way to use Agentic RAG in any enterprise☆3,972Updated 2 weeks ago
- ☆7,156Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆18,680Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆7,353Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,430Updated this week
- Local AI API Platform☆2,326Updated this week
- Open Source framework for voice and multimodal conversational AI☆4,299Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,908Updated this week
- Large Action Model framework to develop AI Web Agents☆5,807Updated 2 months ago
- 📃 A better UX for chat, writing content, and coding with LLMs.☆3,443Updated 2 weeks ago
- Open source Claude Artifacts – built with Llama 3.1 405B☆5,083Updated this week
- Local realtime voice AI☆2,162Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,746Updated 2 months ago
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,467Updated this week
- A fast multimodal LLM for real-time voice☆2,760Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,775Updated 3 months ago
- A vector search SQLite extension that runs anywhere!☆4,670Updated last week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,642Updated last week
- Composable building blocks to build Llama Apps☆6,036Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆4,966Updated this week
- Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist …☆10,626Updated last month