☆14Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for retnet
Users that are interested in retnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Jul 20, 2023Updated 2 years ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent,…☆227Mar 12, 2024Updated 2 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆105Nov 24, 2023Updated 2 years ago
- customizable robust Independent Component Analysis (ICA)☆12Sep 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A minimal command-line utility written in Rust for querying GPU status☆24Dec 21, 2025Updated 5 months ago
- Implementation of Retention-Network in PyTorch☆17Aug 12, 2023Updated 2 years ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- ☆22Jul 24, 2023Updated 2 years ago
- ☆11May 6, 2021Updated 5 years ago
- ☆33Oct 21, 2022Updated 3 years ago
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"☆1,215Oct 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A graphing calculator written in c.☆13Oct 17, 2023Updated 2 years ago
- Parallel implementations of Bellman-Ford algorithm with MPI, OpenMP and CUDA.☆11Sep 25, 2018Updated 7 years ago
- A program that runs a sobel filter edge detection algorithm on an image using a single thread on the CPU, another using OpenMP to paralle…☆10Oct 18, 2017Updated 8 years ago
- MRCPSP: This is an implementation of multi-mode resource constrained project scheduling problem (MRCPSP) in MATLAB.☆11May 10, 2019Updated 7 years ago
- PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularl…☆27Jul 9, 2023Updated 2 years ago
- ☆17Apr 10, 2024Updated 2 years ago
- Open Sourced ML Research Paper Implementations in Tensorflow☆18Jan 8, 2022Updated 4 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Feb 21, 2023Updated 3 years ago
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated 2 years ago
- ☆19Oct 14, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆15Jul 22, 2024Updated last year
- code for Automatic Modulation Open Set Recognition with diffusion models☆19Jan 4, 2025Updated last year
- Official Code of Decoupled Graph Convolution (DGC)☆16Jan 31, 2026Updated 3 months ago
- A Comprehensive Survey of Deep Learning for Multivariate Time Series Forecasting: A Channel Strategy Perspective☆37Jan 19, 2026Updated 4 months ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆49Oct 21, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 用parl框架的DQN强化学习算法玩“合成大西瓜”☆14Mar 5, 2021Updated 5 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 3 years ago
- ☆34Jan 9, 2024Updated 2 years ago
- Use for Generating Radar Active Jamming Signal Modulation Dataset(11 Types).☆36Feb 11, 2026Updated 3 months ago
- ☆10May 1, 2023Updated 3 years ago
- Official Code for the paper: "Composite Feature Selection using Deep Ensembles"☆25Mar 26, 2023Updated 3 years ago
- ☆23Oct 6, 2024Updated last year