evilsocket / cakeLinks
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
☆2,901Updated last year
Alternatives and similar repositories for cake
Users that are interested in cake are comparing it to the libraries listed below
Sorting:
- Fast, flexible LLM inference☆6,449Updated last week
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,815Updated 2 weeks ago
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,588Updated this week
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,803Updated last month
- A vector search SQLite extension that runs anywhere!☆6,787Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 4 months ago
- Local AI API Platform☆2,762Updated 7 months ago
- Local realtime voice AI☆2,425Updated 2 months ago
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙 Alternative to projects like llm-d, Docker Model R…☆1,447Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,154Updated this week
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆993Updated last month
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,557Updated 2 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,213Updated last month
- llama and other large language models on iOS and MacOS offline using GGML library.☆1,968Updated last week
- Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models☆2,852Updated last year
- Minimal LLM inference in Rust☆1,029Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,635Updated 2 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,128Updated 2 months ago
- Making the community's best AI chat models available to everyone.☆1,980Updated last year
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,964Updated last month
- A language model programming library.☆5,878Updated 8 months ago
- Deep learning at the speed of light.☆2,766Updated this week
- AICI: Prompts as (Wasm) Programs☆2,062Updated last year
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,108Updated this week
- A fast llama2 decoder in pure Rust.☆1,059Updated 2 years ago
- A self-organizing file system with llama 3☆5,705Updated 6 months ago
- Implementation for MatMul-free LM.☆3,052Updated 2 months ago
- official repository of aiXcoder-7B Code Large Language Model☆2,273Updated 6 months ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,794Updated last month
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,760Updated 7 months ago