an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for RetNet
Users that are interested in RetNet are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Jul 20, 2023Updated 2 years ago
- ☆14Jul 26, 2023Updated 2 years ago
- customizable robust Independent Component Analysis (ICA)☆12Sep 16, 2024Updated last year
- Implementation of Retention-Network in PyTorch☆17Aug 12, 2023Updated 2 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆56Aug 20, 2024Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- CVPR2023: AttriCLIP: A Non-Incremental Learner for Incremental Knowledge Learning☆18May 19, 2023Updated 2 years ago
- ☆58Jul 9, 2024Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆67Apr 24, 2024Updated last year
- ☆24Apr 7, 2024Updated last year
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆25Jan 6, 2025Updated last year
- Jupyterlab extension containing a UI for debugging☆10Dec 2, 2019Updated 6 years ago
- OrientDB Database Interface for Erlang☆11Mar 7, 2014Updated 12 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆226Mar 12, 2024Updated 2 years ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆106Nov 24, 2023Updated 2 years ago
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Elixir/Phoenix collaborative drawing board demonstration project☆18Jan 3, 2023Updated 3 years ago
- Benchmarks from the RELEASE project☆12Jun 8, 2016Updated 9 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- Consistent Prompting for Rehearsal-Free Continual Learning [CVPR2024]☆35Jun 12, 2025Updated 9 months ago
- ☆19Oct 14, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)☆15May 7, 2020Updated 5 years ago
- A python version of fast and robust ICA based on the paper of Aapo Hyvärinen.☆32Apr 18, 2023Updated 2 years ago
- ☆33Jan 9, 2024Updated 2 years ago
- 😜Constrative Learning of Sentence Embedding using LoRA (EECS487 final project)☆13Apr 19, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 4 months ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Description: Frequency Augmented Variational Autoencoder for better Image Reconstruction☆44Oct 11, 2023Updated 2 years ago
- ☆10May 11, 2017Updated 8 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Active development for Erlang: rebuild and reload source/binary files while the VM is running☆52Dec 15, 2017Updated 8 years ago
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 4 years ago
- Notebooks for RAG optimization workshop, using HackerNews data☆21Mar 27, 2024Updated last year
- ☆10Dec 18, 2023Updated 2 years ago