RWKV-APP / RWKV_APPLinks
A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference.
☆33Updated this week
Alternatives and similar repositories for RWKV_APP
Users that are interested in RWKV_APP are comparing it to the libraries listed below
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆17Updated 2 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 9 months ago
- The DPAB-α Benchmark☆30Updated 9 months ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆25Updated 3 weeks ago
- ☆14Updated 6 months ago
- Golang web client for Ollama, fast and easy to use.☆29Updated 3 months ago
- RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework☆46Updated 2 weeks ago
- SVGBench: A challenging LLM benchmark that tests knowledge, coding, physical reasoning capabilities of LLMs.☆54Updated this week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆46Updated 2 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆143Updated 4 months ago
- ☆24Updated 9 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆46Updated last month
- AirLLM 70B inference with single 4GB GPU☆14Updated 4 months ago
- JotItNow is a AI Voice Notes App☆21Updated 8 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆214Updated 2 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆58Updated 11 months ago
- Shinkai allows you to create AI agents without touching code. Define tasks, schedule actions, and let Shinkai write custom code for you. …☆63Updated this week
- Inference RWKV v7 in pure C.☆41Updated last month
- Sparse Inferencing for transformer based LLMs☆201Updated 3 months ago
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆36Updated 2 months ago
- Create 3D files in the CLI with Small Language Model☆41Updated 3 weeks ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆33Updated 3 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 6 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆126Updated this week
- Booster - open accelerator for LLM models. Better inference and debugging for AI hackers☆163Updated last year
- ☆61Updated 4 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 10 months ago
- ☆49Updated last month