ROSA-Tuning
☆71Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for ROSA-Tuning
Users that are interested in ROSA-Tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- ☆13Dec 21, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last week
- ☆176Jan 13, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton☆49Apr 2, 2026Updated 2 weeks ago
- ☆12Dec 14, 2024Updated last year
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.☆98Mar 26, 2026Updated 3 weeks ago
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Mar 27, 2026Updated 2 weeks ago
- Helper Tool for Card Preview Modding in Clash Royale☆17Nov 3, 2025Updated 5 months ago
- ☆17Apr 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆148Nov 22, 2024Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Aug 13, 2024Updated last year
- ☆18Sep 29, 2024Updated last year
- [TMI'22] Personalized Retrogress-Resilient Federated Learning Towards Imbalanced Medical Data☆15Jul 20, 2022Updated 3 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 6 months ago
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- [ICLR 2025] Official implementation for "StringLLM: Understanding the String Processing Capability of Large Language Models"☆22Jan 23, 2025Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆348Apr 1, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Jan 1, 2025Updated last year
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆53Aug 6, 2025Updated 8 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Feb 2, 2025Updated last year
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 weeks ago
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- [MICCAI 2023] GRACE: Enhancing Federated Learning for Medical Imaging with Generalized and Personalized Gradient Correction☆17Jun 29, 2023Updated 2 years ago
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆84Sep 5, 2025Updated 7 months ago
- My collection of dotfiles☆14Mar 16, 2026Updated 3 weeks ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 3 weeks ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- 直観主義の命題論理+自然演繹の中で与えられた定理を検証する遺伝的アルゴリズムを用いた証明探索エンジン☆19Mar 11, 2026Updated last month
- A port of the RWKV v7 language model, implemented with the Burn deep learning framework☆14Jun 9, 2025Updated 10 months ago
- Public female English corpus used for Project AI❤dol☆14Dec 28, 2025Updated 3 months ago
- Enemies for your LLM☆35Jan 20, 2026Updated 2 months ago
- RADLADS training code☆39May 7, 2025Updated 11 months ago