A 20M RWKV v6 can do nonogram
☆14Oct 18, 2024Updated last year
Alternatives and similar repositories for RWKV-nonogram
Users that are interested in RWKV-nonogram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆13Dec 21, 2024Updated last year
- ☆27Feb 26, 2026Updated 3 weeks ago
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 7 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆32Mar 9, 2026Updated 2 weeks ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- Parse command line arguments by defining dataclasses☆13Oct 13, 2024Updated last year
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 5 months ago
- ☆17Jan 1, 2025Updated last year
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- RADLADS training code☆37May 7, 2025Updated 10 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆22Dec 17, 2025Updated 3 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆153Dec 14, 2025Updated 3 months ago
- Experiments on the impact of depth in transformers and SSMs.☆41Oct 23, 2025Updated 5 months ago
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Feb 20, 2026Updated last month
- Model Context Protocol (MCP) library for the D language☆13Sep 14, 2025Updated 6 months ago
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- An AI Agent using MoonBit☆12Nov 29, 2024Updated last year
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆133Jul 20, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated last year
- A free open-source visual novel engine written in D.☆24Jan 17, 2026Updated 2 months ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆56Updated this week
- Inference RWKV with multiple supported backends.☆82Mar 11, 2026Updated 2 weeks ago
- Thaumcraft 4 Addon☆13Mar 15, 2026Updated last week
- ☆14Updated this week
- Linux terminal app to change screen color temperature.☆12Jul 11, 2024Updated last year
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- Fixed- and floating-point Kalman filters for resource-constrained environments, written in Rust.☆19Jun 24, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Sep 29, 2024Updated last year
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 5 months ago
- WebGPU demo written in D☆16Jul 30, 2025Updated 7 months ago
- ☆13Jan 17, 2024Updated 2 years ago
- A plugin for Hatch that runs build scripts and saves their artifacts.☆25May 29, 2025Updated 9 months ago
- An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"☆17Oct 6, 2025Updated 5 months ago