Softened ROSA QKV Operators for Training Next-Generation LLM Models
☆36Apr 7, 2026Updated last week
Alternatives and similar repositories for rosa_soft
Users that are interested in rosa_soft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ROSA-Tuning☆71Feb 4, 2026Updated 2 months ago
- ☆13May 11, 2025Updated 11 months ago
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- ☆14May 15, 2023Updated 2 years ago
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sources and examples for ASPLOS20 paper☆14Jul 21, 2020Updated 5 years ago
- Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers [https://arxiv.org/pdf/2112.04934.pdf]☆15May 13, 2023Updated 2 years ago
- Efficient RWKV inference engine. RWKV7 7.2B fp16 decoding 10250 tps @ single 5090.☆98Mar 26, 2026Updated 2 weeks ago
- Official PyTorch implementation of paper "Schema Inference for Interpretable Image Classification" (ICLR 2023)☆15Apr 6, 2023Updated 3 years ago
- This project is established for real-time training of the RWKV model.☆50May 17, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A scalable distributed graph partitioner. Ongoing research.☆15Apr 23, 2017Updated 8 years ago
- Streaming Graph Server with partitioning☆15Aug 17, 2023Updated 2 years ago
- [ICLR2026] The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆27Oct 14, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast modular code to create and train cutting edge LLMs☆68May 16, 2024Updated last year
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆50Mar 21, 2026Updated 3 weeks ago
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated 2 weeks ago
- The source code of "A Comprehensive Survey and Experimental Study of Subgraph Matching: Trends, Unbiasedness, and Interaction"☆17Sep 6, 2024Updated last year
- pure go for rwkv☆18Dec 31, 2023Updated 2 years ago
- The benchmark proposed in paper: GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability☆25Aug 12, 2025Updated 8 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- The official implementation of Bi-Mamba☆15Oct 22, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Oct 11, 2023Updated 2 years ago
- Table extraction library☆31Mar 9, 2025Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆33Oct 13, 2025Updated 6 months ago
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 3 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 6 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆57Mar 23, 2026Updated 3 weeks ago
- The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)☆55Jan 28, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SGLang kernel library for NPU☆115Updated this week
- Prose Markup Language☆10Mar 31, 2023Updated 3 years ago
- Re:Phiedit 指南:通过重构 RPE 说明书结构,优化阅读体验,减少 RPE 说明书的理解成本☆16Apr 16, 2023Updated 2 years ago
- ☆25Apr 20, 2023Updated 2 years ago
- a.k.a autoMBW-V2☆10Sep 6, 2024Updated last year
- A Survey of Direct Preference Optimization (DPO)☆90Jul 4, 2025Updated 9 months ago
- pytorch implementation of grok☆12Apr 6, 2026Updated last week