A 20M RWKV v6 can do nonogram
☆14Oct 18, 2024Updated last year
Alternatives and similar repositories for RWKV-nonogram
Users that are interested in RWKV-nonogram are comparing it to the libraries listed below
Sorting:
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆13Dec 21, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 6 months ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- ☆17Jan 1, 2025Updated last year
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆47Oct 21, 2025Updated 4 months ago
- ☆27Feb 26, 2026Updated last week
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- RADLADS training code☆37May 7, 2025Updated 9 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…☆31Jan 28, 2026Updated last month
- ☆12Sep 30, 2018Updated 7 years ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Jan 28, 2026Updated last month
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 4 months ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆70Jan 12, 2026Updated last month
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆73May 26, 2024Updated last year
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆133Jul 20, 2024Updated last year
- [EMNLP 2024] RWKV-CLIP: A Robust Vision-Language Representation Learner☆153Dec 14, 2025Updated 2 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- Parse command line arguments by defining dataclasses☆13Oct 13, 2024Updated last year
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 4 months ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 8 years ago
- Balanced K-means in Pytorch with strong GPU acceleration☆12Apr 30, 2020Updated 5 years ago
- ☆14Updated this week
- ☆11Feb 28, 2024Updated 2 years ago
- ☆12Jul 7, 2022Updated 3 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Teaching Categories to Human Learners with Visual Explanations - CVPR 2018☆11Jun 21, 2022Updated 3 years ago
- Thaumcraft 4 Addon☆13Oct 10, 2025Updated 4 months ago