m-a-n-i-f-e-s-t / retentionView external linksLinks
Language modeling with linear-cost context
☆116Sep 25, 2025Updated 4 months ago
Alternatives and similar repositories for retention
Users that are interested in retention are comparing it to the libraries listed below
Sorting:
- ROSA-Tuning☆66Feb 4, 2026Updated 2 weeks ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 5 months ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆32Oct 13, 2025Updated 4 months ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 7 months ago
- ☆67Mar 21, 2025Updated 10 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Apr 30, 2024Updated last year
- ☆27Aug 6, 2024Updated last year
- ☆29Feb 27, 2024Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 9 months ago
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Jan 27, 2026Updated 3 weeks ago
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 2 months ago
- ☆41Apr 30, 2025Updated 9 months ago
- Cloud instance management for deep learning applications.☆38Mar 1, 2022Updated 3 years ago
- Pseudopotential converter from upf to psp8☆11Jan 25, 2023Updated 3 years ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Mar 11, 2025Updated 11 months ago
- ☆39Apr 27, 2024Updated last year
- This repo contains documentation related to the operation of the OpenBytes project.☆13Oct 29, 2021Updated 4 years ago
- ☆35Updated this week
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- Evaluation of Oasis Platform - simple install, UI and API☆14Feb 9, 2026Updated last week
- Matrix Product State algorithm for computing characters of the symmetric group S_n☆11Sep 26, 2025Updated 4 months ago
- ☆10Nov 5, 2022Updated 3 years ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- Attention Kernels for Symmetric Power Transformers☆129Sep 25, 2025Updated 4 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- ☆15Nov 9, 2024Updated last year
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Open-source intelligence (OSINT)☆15Mar 1, 2024Updated last year
- Here is my implementation of Center Loss with Keras☆11May 2, 2018Updated 7 years ago
- ETL project to download and process both CME open interest data, COT data from the CFTC and NAV/shares-outstanding data from various ETF …☆12Jul 13, 2021Updated 4 years ago
- BNG Image Format Implementation☆12Sep 19, 2020Updated 5 years ago
- [MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"☆14Nov 1, 2024Updated last year
- ☆13Dec 8, 2025Updated 2 months ago
- Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more☆12Updated this week
- 사용자인증 API서비스☆10Apr 21, 2021Updated 4 years ago
- Counterfactual Explanation Based on Gradual Construction for Deep Networks Pytorch☆11Apr 7, 2021Updated 4 years ago