Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆91May 2, 2026Updated last week
Alternatives and similar repositories for rwkv-qualcomm
Users that are interested in rwkv-qualcomm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference RWKV with multiple supported backends.☆88May 2, 2026Updated last week
- Infere RWKV on NCNN☆49Sep 3, 2024Updated last year
- ☆12Feb 20, 2026Updated 2 months ago
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Apr 16, 2026Updated 3 weeks ago
- ☆125Dec 15, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 3 months ago
- ☆33Jul 23, 2024Updated last year
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆35Mar 9, 2026Updated 2 months ago
- ☆41Apr 30, 2025Updated last year
- ☆10Aug 18, 2023Updated 2 years ago
- Run Chinese MobileBert model on SNPE.☆15May 19, 2023Updated 2 years ago
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆41Jul 14, 2025Updated 9 months ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Apr 2, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- UIE(Universal Information Extraction) infer by ncnn☆15Sep 22, 2024Updated last year
- Let's use Qualcomm NPU in Android☆20Feb 18, 2025Updated last year
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆29Jan 1, 2025Updated last year
- A fast RWKV Tokenizer written in Rust☆54Aug 12, 2025Updated 8 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆26Feb 26, 2026Updated 2 months ago
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆133Jul 20, 2024Updated last year
- YOLOv5在高通AI Engine Direct环境下进行QNN量化,CPU推理的项目☆17Sep 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 用户友好、开箱即用的 RWKV Prompts 示例,适用于所有用户。Awesome RWKV Prompts for general users, more user-friendly, ready-to-use prompt examples.☆34Apr 13, 2026Updated 3 weeks ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- QAI AppBuilder is designed to help developers easily execute models on WoS and Linux platforms. It encapsulates the Qualcomm® AI Runtime …☆155Apr 28, 2026Updated last week
- BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling☆33Jul 11, 2025Updated 9 months ago
- Whisper in TensorRT-LLM☆17Sep 21, 2023Updated 2 years ago
- ☆188Apr 24, 2026Updated 2 weeks ago
- A 100% locally run AI web tool for generating WeChat replies using the RWKV runner☆10Oct 29, 2024Updated last year
- ☆14May 11, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆57Mar 31, 2026Updated last month
- ☆11Oct 30, 2021Updated 4 years ago
- High-speed and easy-use LLM serving framework for local deployment☆150Aug 7, 2025Updated 9 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆97Oct 8, 2025Updated 7 months ago
- RWKV-LM-V7(https://github.com/BlinkDL/RWKV-LM) Under Lightning Framework☆59Dec 24, 2025Updated 4 months ago
- ☆177Jan 13, 2026Updated 3 months ago
- LLM inference in C/C++☆51May 2, 2026Updated last week