possibly useful materials for learning RWKV language model.
☆26Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-howto
Users that are interested in RWKV-howto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 16, 2024Updated last year
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated 2 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 3 years ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 6 months ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- ☆10May 12, 2022Updated 4 years ago
- BlinkDL's RWKV-v4 running in the browser☆48Mar 2, 2023Updated 3 years ago
- Code for the paper: "T-shape data and probabilistic remaining useful life prediction for Li-ion batteries using multiple non-crossing qua…☆10Aug 4, 2023Updated 2 years ago
- ☆19Dec 12, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- ☆23Oct 10, 2025Updated 7 months ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 9 months ago
- ☆18Sep 27, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆15Nov 11, 2023Updated 2 years ago
- A Zen approach to configuring your Python project☆17Feb 27, 2026Updated 3 months ago
- Code for TIP 2024 paper: Sparse Coding Inspired LSTM and Self-Attention Integration for Medical Image Segmentation☆13Oct 28, 2024Updated last year
- Python implementation of AWarp algorithm☆14Aug 6, 2021Updated 4 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 11 months ago
- This app forecasts the live traffic for the next 3 hours in the famous streets of Paris. Additionally, it also provides statistics for th…☆13Jul 16, 2024Updated last year
- RWKV centralised docs for the community☆34Jan 17, 2026Updated 4 months ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Mar 19, 2021Updated 5 years ago
- Resources and programs to generated models (URDF, SDF) of the iCub robot☆15Mar 18, 2026Updated 2 months ago
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆36Feb 21, 2025Updated last year
- Conditional Linear Dynamical Systems☆17Oct 7, 2025Updated 7 months ago
- Personal solutions to the Triton Puzzles☆21Jul 18, 2024Updated last year
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆17Dec 18, 2024Updated last year
- Offical implementation of our paper "Exploring the Potential of Diffusion Large Language Models in Code Generation".☆22Oct 29, 2025Updated 7 months ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Sep 14, 2022Updated 3 years ago
- 实现Blip2RWKV+QFormer的多模态图文对话大模型,使用Two-Step Cognitive Psychology Prompt方法,仅3B参数的模型便能够出现类人因果思维链。对标MiniGPT-4,ImageBind等图文对话大语言模型,力求以更小的算力和资源实…☆42Jul 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch implementation of "Towards k-means-friendly spaces: Simultaneous deep learning and clustering," Bo Yang et al., 2017.☆17Jan 15, 2021Updated 5 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- The official implementation of dLLM-Var☆34Nov 6, 2025Updated 6 months ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- 1.2% test error on MNIST using only least squares and numpy calls.☆22Sep 13, 2023Updated 2 years ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- ☆26Dec 3, 2023Updated 2 years ago