RL10x/RetNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RL10x/RetNet)

RL10x / RetNet

an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf

☆11

Alternatives and similar repositories for RetNet

Users that are interested in RetNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShaderManager / RetNet
View on GitHub
PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models
☆14Jul 20, 2023Updated 3 years ago
prateekstark / retnet
View on GitHub
☆14Jul 26, 2023Updated 2 years ago
CRG-CNAG / robustica
View on GitHub
customizable robust Independent Component Analysis (ICA)
☆12Sep 16, 2024Updated last year
myscience / retnet-pytorch
View on GitHub
Implementation of Retention-Network in PyTorch
☆17Aug 12, 2023Updated 2 years ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
vanity1129 / AttriCLIP
View on GitHub
CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning
☆18May 19, 2023Updated 3 years ago
TalnUPF / ConceptExtraction
View on GitHub
☆11Aug 15, 2023Updated 2 years ago
CVIR / ConvPrompt
View on GitHub
☆24Apr 7, 2024Updated 2 years ago
markendo / downscaling_intelligence
View on GitHub
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
☆25Mar 21, 2026Updated 3 months ago
OpenNLPLab / HGRN
View on GitHub
[NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…
☆68Apr 24, 2024Updated 2 years ago
QuantStack / jupyterlab-debugger
View on GitHub
Jupyterlab extension containing a UI for debugging
☆10Dec 2, 2019Updated 6 years ago
aleksmeshr / odi
View on GitHub
OrientDB Database Interface for Erlang
☆11Mar 7, 2014Updated 12 years ago
HazyResearch / prefix-linear-attention
View on GitHub
☆62Jul 9, 2024Updated 2 years ago
agemoai / arcsolver
View on GitHub
A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.
☆28Jan 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
release-project / benchmarks
View on GitHub
Benchmarks from the RELEASE project
☆13Jun 8, 2016Updated 10 years ago
syncdoth / RetNet
View on GitHub
Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…
☆227Mar 12, 2024Updated 2 years ago
fkodom / yet-another-retnet
View on GitHub
A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…
☆105Nov 24, 2023Updated 2 years ago
adam-younes / calculator
View on GitHub
A graphing calculator written in c.
☆15Oct 17, 2023Updated 2 years ago
arjan / scribble
View on GitHub
Elixir/Phoenix collaborative drawing board demonstration project
☆18Jan 3, 2023Updated 3 years ago
MiniXC / opensubtitles-dataloader
View on GitHub
Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.
☆13Aug 26, 2020Updated 5 years ago
uwiger / gen_leader_revival
View on GitHub
A project to unify various implementations of the Erlang library gen_leader into a modern, robust single implementation
☆15Aug 2, 2011Updated 14 years ago
Felix-Yan / FastICA
View on GitHub
A python version of fast and robust ICA based on the paper of Aapo Hyvärinen.
☆32Apr 18, 2023Updated 3 years ago
wesg52 / llm-context-neurons
View on GitHub
Find context neurons in Pythia models.
☆13Jun 13, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Zhanxin-Gao / CPrompt
View on GitHub
Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]
☆34Jun 12, 2025Updated last year
LAIR-RCC / ruadapt
View on GitHub
☆14Jan 17, 2024Updated 2 years ago
catid / dataloader
View on GitHub
High-performance tokenized language data-loader for Python C++ extension
☆15Jul 22, 2024Updated last year
kyegomez / HSSS
View on GitHub
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…
☆16Nov 11, 2024Updated last year
alicenet / alicenet
View on GitHub
Official repository for the AliceNet layer2 blockchain
☆18Feb 14, 2026Updated 5 months ago
jiaowoguanren0615 / RetNet_ViT-RMT-
View on GitHub
☆34Jan 9, 2024Updated 2 years ago
lucidrains / hyena-dna
View on GitHub
Fork of HyenaDNA, a long-range genomic foundation model built with Hyena
☆10Aug 14, 2023Updated 2 years ago
rob-brown / ParallEx
View on GitHub
Parallel collections for Elixir
☆18Oct 30, 2014Updated 11 years ago
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
greg-kennedy / p5-NRL-TextToPhoneme
View on GitHub
Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)
☆17May 7, 2020Updated 6 years ago
tic-top / LoraCSE
View on GitHub
😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)
☆13Apr 19, 2023Updated 3 years ago
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago
vijeth8 / lda2vec-featurizer
View on GitHub
☆10May 11, 2017Updated 9 years ago
raylsnetwork / axyl
View on GitHub
Rayls L1 node with reth + bullshark + narwhal
☆28Updated this week
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
Hakanaou / deepLuna
View on GitHub
Text extractor/injector for Tsukihime Remake
☆39Sep 4, 2023Updated 2 years ago