Language modeling with linear-cost context
☆118Sep 25, 2025Updated 7 months ago
Alternatives and similar repositories for retention
Users that are interested in retention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 8 months ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆57Mar 31, 2026Updated last month
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 7 months ago
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆35Oct 13, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆41Apr 30, 2025Updated last year
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- ☆69Mar 21, 2025Updated last year
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆13Mar 24, 2024Updated 2 years ago
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆67May 13, 2026Updated last week
- Implementation of BitNet-1.58 instruct tuning☆29Apr 14, 2024Updated 2 years ago
- ☆30Feb 27, 2024Updated 2 years ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 5 months ago
- Compression performance of BPG, JPEG, JPEG2000 and Webp.☆12May 15, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Sep 29, 2024Updated last year
- A PyTorch implementation of the shearlet transform.☆14Oct 9, 2025Updated 7 months ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Reinforcing General Reasoning without Verifiers☆100Jun 24, 2025Updated 10 months ago
- ☆14Aug 9, 2023Updated 2 years ago
- Decompose skin into its independent components: heamoglobin and melanin components☆10Jan 6, 2015Updated 11 years ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- ☆17Jan 1, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Cloud instance management for deep learning applications.☆38Mar 1, 2022Updated 4 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 8 years ago
- Reactively track user's online, offline, and idle statuses☆10Jun 3, 2022Updated 3 years ago
- Attention Kernels for Symmetric Power Transformers☆130Sep 25, 2025Updated 7 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆355Feb 18, 2026Updated 3 months ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆11Jul 10, 2024Updated last year
- ☆11Jun 14, 2019Updated 6 years ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Python script for encrypting and decrypting in the same as an enigma machine☆17Aug 15, 2011Updated 14 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A sample project to demonstrate precompilation using Rustler☆17Mar 26, 2026Updated last month
- ☆10Jul 13, 2024Updated last year
- 🔍 Code Search Tools & Experiments☆12May 4, 2026Updated 2 weeks ago
- ☆14Dec 30, 2025Updated 4 months ago
- Code for EMNLP 2023 paper: DALE: Generative Data Augmentation for Low-Resource Legal NLP☆10Oct 27, 2023Updated 2 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- Vibe coding in emacs with amp☆20Jun 15, 2025Updated 11 months ago