npk48 / rwkv_cuda
☆11Updated last year
Alternatives and similar repositories for rwkv_cuda:
Users that are interested in rwkv_cuda are comparing it to the libraries listed below
- A converter and basic tester for rwkv onnx☆42Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Course Project for COMP4471 on RWKV☆17Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆11Updated 3 months ago
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Inference RWKV with multiple supported backends.☆43Updated this week
- RWKV centralised docs for the community☆24Updated last month
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆26Updated last year
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- Training a reward model for RLHF using RWKV.☆14Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆31Updated last year
- Experiments with BitNet inference on CPU☆54Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆50Updated 2 weeks ago
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated last year
- ChatGPT-like Web UI for RWKVstic☆19Updated 2 years ago
- 👷 Build compute kernels☆37Updated this week
- Web browser version of StarCoder.cpp☆45Updated last year
- asynchronous/distributed speculative evaluation for llama3☆39Updated 9 months ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆20Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆46Updated 2 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆22Updated last month
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- ☆16Updated 11 months ago
- Thin wrapper around GGML to make life easier☆27Updated last week