ROSA-Tuning
☆74Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for ROSA-Tuning
Users that are interested in ROSA-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆36Oct 13, 2025Updated 8 months ago
- ☆12Dec 21, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆38Apr 7, 2026Updated 2 months ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆179Jan 13, 2026Updated 5 months ago
- RADLADS training code☆44May 7, 2025Updated last year
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Apr 2, 2026Updated 2 months ago
- ☆12Dec 14, 2024Updated last year
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- GoldFinch and other hybrid transformer components☆13Dec 9, 2025Updated 6 months ago
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15May 20, 2026Updated 3 weeks ago
- Helper Tool for Card Preview Modding in Clash Royale☆17Nov 3, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- RWKV-7 7.2B fp16 15000+ tps decoding @ single 5090☆115Jun 7, 2026Updated last week
- ☆18Apr 14, 2025Updated last year
- 为 RWKV 设计的「Deep Think」实现。☆27Dec 7, 2025Updated 6 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- A Tensorflow2.0 implementation of Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network☆10Mar 5, 2021Updated 5 years ago
- Language modeling with linear-cost context☆119Sep 25, 2025Updated 8 months ago
- Data Hiding in Image☆10Apr 9, 2020Updated 6 years ago
- ☆150Nov 22, 2024Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆147Aug 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Sep 29, 2024Updated last year
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 8 months ago
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- [ICLR 2025] Official implementation for "StringLLM: Understanding the String Processing Capability of Large Language Models"☆22Jan 23, 2025Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆354Jun 1, 2026Updated 2 weeks ago
- ☆17Jan 1, 2025Updated last year
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆31Oct 14, 2025Updated 8 months ago
- ☆14Apr 10, 2024Updated 2 years ago
- Fix BibTeX databases with Crossref metadata☆11Nov 24, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Apr 6, 2025Updated last year
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Feb 2, 2025Updated last year
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- Master DSA With Python☆11Feb 20, 2023Updated 3 years ago
- Evaluating LLMs with Dynamic Data☆115May 9, 2026Updated last month
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆88Sep 5, 2025Updated 9 months ago