PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models
☆14Jul 20, 2023Updated 2 years ago
Alternatives and similar repositories for RetNet
Users that are interested in RetNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆227Mar 12, 2024Updated 2 years ago
- customizable robust Independent Component Analysis (ICA)☆12Sep 16, 2024Updated last year
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆16Sep 18, 2025Updated 6 months ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 5 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Interactive Continual Semantic Segmentation☆12Apr 13, 2022Updated 3 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆16Feb 15, 2023Updated 3 years ago
- Python interface and preprocessing pipeline for the BBBC021 dataset of cellular images☆14Sep 19, 2021Updated 4 years ago
- Sparse representation solvers for P0- and P1-problems☆10Nov 30, 2023Updated 2 years ago
- t-Distributed Stochastic Neighbor Embedding applyed on the hyperspectral dataset and the generated feature maps.☆11Jun 2, 2022Updated 3 years ago
- ☆15Jul 22, 2024Updated last year
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Hypergraph Structured Deep Auto-encoders for Hyperspectral Image Clustering and Semi-Supervised Classification☆27Oct 9, 2021Updated 4 years ago
- ☆19Jan 7, 2026Updated 3 months ago
- A simple image segmentation model called ‘my_FCN’ is compared with a conventional U-Net architecture and DeepLabV3+ on a subset of the Ci…☆12Dec 4, 2022Updated 3 years ago
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Multidimensional RNN in Keras Tensorflow☆20Feb 24, 2020Updated 6 years ago
- ☆20Apr 8, 2025Updated last year
- ☆19Oct 14, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A python version of fast and robust ICA based on the paper of Aapo Hyvärinen.☆32Apr 18, 2023Updated 2 years ago
- [CVPR 2025] Custom Open CLIP repo to train biomedical CLIP models☆35Mar 23, 2025Updated last year
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Hyperspectral Image Classification Using Feature Fusion Hypergraph Convolution Neural Network☆30Mar 16, 2022Updated 4 years ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- An implementation of the neural network described in "Convolution Based Spectral Partitioning Architecture for Hyperspectral Image Classi…☆15Jul 5, 2020Updated 5 years ago
- 使用自然语言绘制流程图,基于OpenAI☆12Nov 13, 2023Updated 2 years ago
- ☆51Jan 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Text extractor/injector for Tsukihime Remake☆38Sep 4, 2023Updated 2 years ago
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- ☆10Dec 18, 2023Updated 2 years ago
- A super aggregator for some other free v2ray aggregators☆14Updated this week
- Implementing Pyramid Scene Parsing Network (PSPNet) paper using Pytorch☆16Sep 1, 2020Updated 5 years ago
- ☆13Dec 15, 2025Updated 3 months ago
- ☆50Jan 28, 2025Updated last year