ROSA-Tuning
☆71Feb 4, 2026Updated last month
Alternatives and similar repositories for ROSA-Tuning
Users that are interested in ROSA-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 5 months ago
- ☆13Dec 21, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- ☆179Jan 13, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆48Aug 22, 2025Updated 7 months ago
- ☆12Dec 14, 2024Updated last year
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.☆95Feb 1, 2026Updated last month
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Feb 20, 2026Updated last month
- Helper Tool for Card Preview Modding in Clash Royale☆17Nov 3, 2025Updated 4 months ago
- 为 RWKV 设计的「Deep Think」实现。☆26Dec 7, 2025Updated 3 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆17Apr 14, 2025Updated 11 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Language modeling with linear-cost context☆119Sep 25, 2025Updated 6 months ago
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- ☆148Nov 22, 2024Updated last year
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 5 months ago
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆346Jan 10, 2026Updated 2 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] Official implementation for "StringLLM: Understanding the String Processing Capability of Large Language Models"☆22Jan 23, 2025Updated last year
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆29Updated this week
- ☆17Jan 1, 2025Updated last year
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆52Aug 6, 2025Updated 7 months ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Dec 23, 2025Updated 3 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Feb 2, 2025Updated last year
- Official implementation of the TTS model Lina-Speech☆179Jan 9, 2025Updated last year
- pure go for rwkv☆19Dec 31, 2023Updated 2 years ago
- ☆14Apr 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Evaluating LLMs with Dynamic Data☆113Feb 11, 2026Updated last month
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆84Sep 5, 2025Updated 6 months ago
- My collection of dotfiles☆14Mar 16, 2026Updated last week
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 3 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- A port of the RWKV v7 language model, implemented with the Burn deep learning framework☆14Jun 9, 2025Updated 9 months ago