Language modeling with linear-cost context
☆118Sep 25, 2025Updated 7 months ago
Alternatives and similar repositories for retention
Users that are interested in retention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 6 months ago
- ROSA-Tuning☆71Feb 4, 2026Updated 2 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 11 months ago
- Code repository for "RL Grokking Recipe: How RL Unlocks and Transfers New Algorithms in LLMs""☆33Oct 12, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- ☆41Apr 30, 2025Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- ☆12Dec 21, 2024Updated last year
- A 20M RWKV v6 can do nonogram☆13Oct 18, 2024Updated last year
- ☆69Mar 21, 2025Updated last year
- Optimized primitives for collective multi-GPU communication☆10May 8, 2024Updated last year
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆67Updated this week
- A PyTorch implementation of the shearlet transform.☆13Oct 9, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 4 months ago
- ☆19Sep 29, 2024Updated last year
- Reinforcing General Reasoning without Verifiers☆99Jun 24, 2025Updated 10 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated 2 years ago
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 7 months ago
- Here we will test various linear attention designs.☆62Apr 25, 2024Updated 2 years ago
- ☆17Jan 1, 2025Updated last year
- Cloud instance management for deep learning applications.☆38Mar 1, 2022Updated 4 years ago
- LiveView comment form☆12Mar 9, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- https://www.kaggle.com/c/siim-acr-pneumothorax-segmentation☆11Sep 11, 2019Updated 6 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 8 years ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆349Feb 18, 2026Updated 2 months ago
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆24Apr 20, 2026Updated last week
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆11Jul 10, 2024Updated last year
- ☆11Jun 14, 2019Updated 6 years ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- Python script for encrypting and decrypting in the same as an enigma machine☆17Aug 15, 2011Updated 14 years ago
- A sample project to demonstrate precompilation using Rustler☆17Mar 26, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆86Sep 5, 2025Updated 7 months ago
- ☆10Jul 13, 2024Updated last year
- ☆14Dec 30, 2025Updated 4 months ago
- My collection of dotfiles☆14Apr 22, 2026Updated last week
- A plug for reverse proxy server.☆16May 28, 2024Updated last year
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- Historical and operational core of the OMNIA diagnostics lineage inside the OMNIABASE ecosystem.☆74Apr 20, 2026Updated last week