PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models
☆14Jul 20, 2023Updated 2 years ago
Alternatives and similar repositories for RetNet
Users that are interested in RetNet are comparing it to the libraries listed below
Sorting:
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆14Sep 18, 2025Updated 5 months ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆226Mar 12, 2024Updated last year
- ☆44Nov 1, 2025Updated 4 months ago
- Code for running experiments and benchmarking on GNNExplainer: Generating Explanations for Graph Neural Networks☆15May 8, 2021Updated 4 years ago
- ☆12Sep 19, 2022Updated 3 years ago
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 5 years ago
- ☆11May 16, 2025Updated 9 months ago
- Hyperspectral Image Classification Using Feature Fusion Hypergraph Convolution Neural Network☆32Mar 16, 2022Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- PyTorch for RISC-V Architecture on OpenEuler 24.03☆13Jun 27, 2024Updated last year
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Getting started with MIMIC-III Critical Care Database☆12Mar 3, 2019Updated 6 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 8 months ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- ☆10Mar 21, 2024Updated last year
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Frequency domain (Fast Fourier Transform) and time-frequency (wavelet transform) feature extraction from Electrocardiogram (ECG) data.☆11Apr 30, 2022Updated 3 years ago
- ☆11Aug 24, 2024Updated last year
- Interactive Continual Semantic Segmentation☆12Apr 13, 2022Updated 3 years ago
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆20Jul 22, 2025Updated 7 months ago
- A simple image segmentation model called ‘my_FCN’ is compared with a conventional U-Net architecture and DeepLabV3+ on a subset of the Ci…☆12Dec 4, 2022Updated 3 years ago
- Code for Neural Networks journal paper - StoCFL: A stochastically clustered federated learning framework for Non-IID data with dynamic cl…☆12Apr 28, 2024Updated last year
- ☆18Nov 26, 2025Updated 3 months ago
- The implement of FedCyBGD☆11Jul 19, 2024Updated last year
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Histopathology Feature Extractors (2024)☆12Jun 14, 2024Updated last year
- 使用自然语言绘制流程图,基于OpenAI☆12Nov 13, 2023Updated 2 years ago
- CRUD Word documents with Python☆13Feb 5, 2026Updated 3 weeks ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- The officalimplement of dLLM-Factory☆26Jul 12, 2025Updated 7 months ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆10Apr 5, 2023Updated 2 years ago
- paper code commit-fsmafl☆10Mar 18, 2024Updated last year
- Anomaly Detection for time-series using Multilevel Wavelet Decomposition Networks.☆10Dec 11, 2019Updated 6 years ago
- ☆11Nov 16, 2023Updated 2 years ago
- 从零开始,系统掌握 Anthropic Claude 的核心能力与最佳实践☆22Updated this week