an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf
☆11Jul 25, 2023Updated 2 years ago
Alternatives and similar repositories for RetNet
Users that are interested in RetNet are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Jul 20, 2023Updated 2 years ago
- ☆14Jul 26, 2023Updated 2 years ago
- ☆58Jul 9, 2024Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆56Aug 20, 2024Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆67Apr 24, 2024Updated last year
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆226Mar 12, 2024Updated last year
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- ☆11Nov 27, 2020Updated 5 years ago
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 4 years ago
- UCPR: User-Centric Path Reasoning towards Explainable Recommendation, SIGIR 2021☆12Jun 18, 2022Updated 3 years ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- ☆10Dec 18, 2023Updated 2 years ago
- ☆10May 11, 2017Updated 8 years ago
- The officalimplement of dLLM-Factory☆26Jul 12, 2025Updated 7 months ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Jupyterlab extension containing a UI for debugging☆10Dec 2, 2019Updated 6 years ago
- ☆18Nov 26, 2025Updated 3 months ago
- 可用于中文开放领域信息抽取的数据集☆14Nov 15, 2021Updated 4 years ago
- Visualize neural networks using TikZ in Julia☆15Jan 29, 2025Updated last year
- ☆11Aug 19, 2024Updated last year
- Python package for calculation mahalanobis distances from NumPy arrays☆15Jun 22, 2022Updated 3 years ago
- ReXPlug: Explainable Recommendation using Plug and Play Language Model, SIGIR 2021☆10Nov 14, 2021Updated 4 years ago
- Showing how to use CUDA on google colab☆13Feb 24, 2025Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- ☆13Dec 15, 2025Updated 2 months ago
- ☆11Apr 22, 2022Updated 3 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆14Sep 18, 2025Updated 5 months ago
- ☆38Dec 26, 2025Updated 2 months ago
- Tools and scripts for experimenting with Transformers: Bert, T5...☆61Jan 6, 2024Updated 2 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆106Nov 24, 2023Updated 2 years ago
- This is the package used to calculate the similarity index of the label graph pairs.☆13Nov 4, 2020Updated 5 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- YOLOv8 for strawberry disease implementation. Achieves over 10% improvement in mAP in comparison to the Mask R-CNN baseline.☆14Jul 6, 2023Updated 2 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- customizable robust Independent Component Analysis (ICA)☆12Sep 16, 2024Updated last year