possibly useful materials for learning RWKV language model.
☆26Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-howto
Users that are interested in RWKV-howto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 16, 2024Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 5 months ago
- BlinkDL's RWKV-v4 running in the browser☆48Mar 2, 2023Updated 3 years ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆19Mar 7, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the paper: "T-shape data and probabilistic remaining useful life prediction for Li-ion batteries using multiple non-crossing qua…☆10Aug 4, 2023Updated 2 years ago
- ☆14Jul 13, 2025Updated 9 months ago
- ☆19Dec 12, 2023Updated 2 years ago
- Official Pytorch implementation of Chromatic Graph Transformers☆10Jun 14, 2023Updated 2 years ago
- Clustered Compositional Embeddings☆12Oct 25, 2023Updated 2 years ago
- ☆23Oct 10, 2025Updated 6 months ago
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 8 months ago
- ☆32Mar 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Contains the code for the paper "Multi-Horizon Short-Term Load Forecasting Using Hybrid of LSTM and Modified Split Convolution"☆11Oct 28, 2023Updated 2 years ago
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 3 months ago
- Code for paper "DB-LSTM: Densely-Connected Bi-directional LSTM for Human Action Recognition"☆13Jul 1, 2022Updated 3 years ago
- ☆12Jan 17, 2024Updated 2 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆22Nov 22, 2024Updated last year
- Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset☆30Apr 2, 2023Updated 3 years ago
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- Least Squares Regression for subspace clustering☆10May 27, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Resources and programs to generated models (URDF, SDF) of the iCub robot☆15Mar 18, 2026Updated last month
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆16Dec 18, 2024Updated last year
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆36Feb 21, 2025Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34May 30, 2023Updated 2 years ago
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- MLOps Model Factory is an end to end workflow that supports generating multiple models and used for deployment to any target.☆10May 9, 2024Updated last year
- Write your own simple OS☆19Jun 2, 2015Updated 10 years ago
- Conditional Linear Dynamical Systems☆16Oct 7, 2025Updated 6 months ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆32Jun 26, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆44Mar 29, 2023Updated 3 years ago
- PyTorch implementation of "Towards k-means-friendly spaces: Simultaneous deep learning and clustering," Bo Yang et al., 2017.☆17Jan 15, 2021Updated 5 years ago
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆19Nov 23, 2022Updated 3 years ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- Several common methods of matrix multiplication are implemented on CPU and Nvidia GPU using C++11 and CUDA.☆14Feb 8, 2023Updated 3 years ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- 1.2% test error on MNIST using only least squares and numpy calls.☆22Sep 13, 2023Updated 2 years ago