☆14Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for retnet
Users that are interested in retnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆227Mar 12, 2024Updated 2 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆105Nov 24, 2023Updated 2 years ago
- customizable robust Independent Component Analysis (ICA)☆12Sep 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A minimal command-line utility written in Rust for querying GPU status☆24Dec 21, 2025Updated 3 months ago
- Implementation of Retention-Network in PyTorch☆17Aug 12, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- ☆17Aug 1, 2023Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- ☆22Jul 24, 2023Updated 2 years ago
- ☆11May 6, 2021Updated 4 years ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularl…☆26Jul 9, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Loads OpenSubtitles v2018 dataset without having to load everything into memory at once. Works well with pytorch.☆13Aug 26, 2020Updated 5 years ago
- Open Sourced ML Research Paper Implementations in Tensorflow☆19Jan 8, 2022Updated 4 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Mar 28, 2026Updated 2 weeks ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- A Comprehensive Survey of Deep Learning for Multivariate Time Series Forecasting: A Channel Strategy Perspective☆36Jan 19, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A semantic segmentation method for high resolution image☆12Jul 1, 2022Updated 3 years ago
- ☆14Jan 22, 2025Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Official Code of Decoupled Graph Convolution (DGC)☆16Jan 31, 2026Updated 2 months ago
- Perl implementation of the Naval Research Laboratory text-to-phoneme algorithm, described by Elovitz et al (1976)☆15May 7, 2020Updated 5 years ago
- A python version of fast and robust ICA based on the paper of Aapo Hyvärinen.☆32Apr 18, 2023Updated 2 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- ☆10Jun 10, 2023Updated 2 years ago
- 用parl框架的DQN强化学习算法玩“合成大西瓜”☆14Mar 5, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Fork of HyenaDNA, a long-range genomic foundation model built with Hyena☆10Aug 14, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Official Code for the paper: "Composite Feature Selection using Deep Ensembles"☆24Mar 26, 2023Updated 3 years ago
- ☆23Oct 6, 2024Updated last year
- Dijkstra's Algorithm implemented in C/C++ using standard C, OpenMP and CUDA☆13Dec 12, 2015Updated 10 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Notebooks for RAG optimization workshop, using HackerNews data☆21Mar 27, 2024Updated 2 years ago