Language modeling with linear-cost context
☆117Sep 25, 2025Updated 6 months ago
Alternatives and similar repositories for retention
Users that are interested in retention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆56Mar 31, 2026Updated last week
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 6 months ago
- ROSA-Tuning☆71Feb 4, 2026Updated 2 months ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆33Oct 13, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆13Dec 21, 2024Updated last year
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆14Oct 18, 2024Updated last year
- Official Chinese documentation for RWKV | RWKV官方中文文档☆15Mar 27, 2026Updated 2 weeks ago
- ☆68Mar 21, 2025Updated last year
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆61Apr 2, 2026Updated last week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- CS341 for Spring 2024☆11Jul 15, 2024Updated last year
- ☆30Feb 27, 2024Updated 2 years ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 3 months ago
- ☆18Sep 29, 2024Updated last year
- ☆13Aug 9, 2023Updated 2 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆22Oct 14, 2025Updated 5 months ago
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 9 months ago
- A program that allows you to chat on VRChat using ChatGPT.☆15Mar 22, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 6 months ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated last year
- ☆17Jan 1, 2025Updated last year
- LiveView comment form☆12Mar 9, 2021Updated 5 years ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆341Feb 18, 2026Updated last month
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 8 years ago
- Attention Kernels for Symmetric Power Transformers☆130Sep 25, 2025Updated 6 months ago
- ☆11Jun 14, 2019Updated 6 years ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- KVN: Keypoints Voting Network with Differentiable RANSAC for Stereo Pose Estimation☆12Mar 4, 2024Updated 2 years ago
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆84Sep 5, 2025Updated 7 months ago
- ☆10Jul 13, 2024Updated last year
- ☆14Dec 30, 2025Updated 3 months ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- MB-X.01 · Logical Origin Node (L.O.N.) — TruthΩ → Co⁺ → Score⁺. Demo e spec verificabili. https://massimiliano.neocities.org/☆71Apr 3, 2026Updated last week
- ChatGPT Clone in LiveView☆17Jun 3, 2023Updated 2 years ago