possibly useful materials for learning RWKV language model.
☆26Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for RWKV-howto
Users that are interested in RWKV-howto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 16, 2024Updated last year
- JAX implementations of RWKV☆19Sep 26, 2023Updated 2 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 4 months ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated last year
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- ☆10May 12, 2022Updated 3 years ago
- ☆21Oct 10, 2025Updated 5 months ago
- The official implementation of HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization☆18Mar 7, 2025Updated last year
- Code for the paper: "T-shape data and probabilistic remaining useful life prediction for Li-ion batteries using multiple non-crossing qua…☆10Aug 4, 2023Updated 2 years ago
- ☆15Jul 13, 2025Updated 8 months ago
- ☆19Dec 12, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Ἀνατομή is a PyTorch library to analyze representation of neural networks☆13Jan 31, 2024Updated 2 years ago
- Don't just regulate gradients like in Muon, regulate the weights too☆32Jul 30, 2025Updated 8 months ago
- Implementation of Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems☆14Nov 11, 2023Updated 2 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated 9 months ago
- Contains the code for the paper "Multi-Horizon Short-Term Load Forecasting Using Hybrid of LSTM and Modified Split Convolution"☆11Oct 28, 2023Updated 2 years ago
- RWKV centralised docs for the community☆32Jan 17, 2026Updated 2 months ago
- This app forecasts the live traffic for the next 3 hours in the famous streets of Paris. Additionally, it also provides statistics for th…☆13Jul 16, 2024Updated last year
- Code for paper "DB-LSTM: Densely-Connected Bi-directional LSTM for Human Action Recognition"☆13Jul 1, 2022Updated 3 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Personal solutions to the Triton Puzzles☆20Jul 18, 2024Updated last year
- ☆12Mar 19, 2021Updated 5 years ago
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆36Feb 21, 2025Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34May 30, 2023Updated 2 years ago
- Conditional Linear Dynamical Systems☆16Oct 7, 2025Updated 5 months ago
- 一个基于Flask实现的RWKV_Role_Playing项目的API。☆31Jun 26, 2024Updated last year
- ☆16Feb 6, 2024Updated 2 years ago
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Sep 14, 2022Updated 3 years ago
- 1.2% test error on MNIST using only least squares and numpy calls.☆21Sep 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization, NeurIPS 2022☆19Nov 23, 2022Updated 3 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 11 months ago
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 2 years ago
- The official implementation of dLLM-Var☆31Nov 6, 2025Updated 4 months ago
- u-MPS implementation and experimentation code used in the paper Tensor Networks for Probabilistic Sequence Modeling (https://arxiv.org/ab…☆19Jul 2, 2020Updated 5 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆30Oct 13, 2024Updated last year
- ☆26Dec 3, 2023Updated 2 years ago