☆14Jul 26, 2023Updated 2 years ago
Alternatives and similar repositories for retnet
Users that are interested in retnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆11Jul 25, 2023Updated 2 years ago
- PyTorch implementation of Retentive Network: A Successor to Transformer for Large Language Models☆14Jul 20, 2023Updated 2 years ago
- A simple but robust PyTorch implementation of RetNet from "Retentive Network: A Successor to Transformer for Large Language Models" (http…☆106Nov 24, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- ☆22Jul 24, 2023Updated 2 years ago
- ☆11May 6, 2021Updated 4 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"☆1,214Oct 22, 2023Updated 2 years ago
- A graphing calculator written in c.☆12Oct 17, 2023Updated 2 years ago
- Parallel implementations of Bellman-Ford algorithm with MPI, OpenMP and CUDA.☆11Sep 25, 2018Updated 7 years ago
- A PyTorch implementation of MixNet: Mixed Depthwise Convolutional Kernels☆11Aug 5, 2019Updated 6 years ago
- PyTorch implementation of the NCDSSM models presented in the ICML '23 paper "Neural Continuous-Discrete State Space Models for Irregularl…☆26Jul 9, 2023Updated 2 years ago
- Official source codes of airsep☆39Mar 26, 2024Updated last year
- Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery☆16Apr 28, 2024Updated last year
- yolov5 TensorRT implementation running on Nvidia Jetson AGX Xavier with RealSense D435☆15Mar 10, 2021Updated 5 years ago
- This is the accompanying website to 'Diff-a-Riff: Musical Accompaniment Co-creation via Latent Diffusion Models'☆14Nov 4, 2024Updated last year
- Implementation and analysis using CUDA and openMP☆12Dec 14, 2016Updated 9 years ago
- Defending AI-Based Automatic Modulation Recognition Models Against Adversarial Attacks☆11Jan 11, 2025Updated last year
- Open Sourced ML Research Paper Implementations in Tensorflow☆19Jan 8, 2022Updated 4 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆13Jan 17, 2024Updated 2 years ago
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- Graph Transformers for Large Graphs☆22Apr 26, 2024Updated last year
- [CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning☆15Sep 23, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- A semantic segmentation method for high resolution image☆12Jul 1, 2022Updated 3 years ago
- Neural Processing Letters: End-to-End Entity Detection with Proposer and Regressor☆12Jun 6, 2023Updated 2 years ago
- ☆14Jan 19, 2024Updated 2 years ago
- [CVPR'24] Solving the Catastrophic Forgetting Problem in Generalized Category Discovery https://arxiv.org/pdf/2501.05272☆16Dec 24, 2024Updated last year
- code for Automatic Modulation Open Set Recognition with diffusion models☆17Jan 4, 2025Updated last year
- Official Code of Decoupled Graph Convolution (DGC)☆16Jan 31, 2026Updated last month
- ☆33Jan 9, 2024Updated 2 years ago
- A large-scale RWKV v7(World, PRWKV, Hybrid-RWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy…☆48Oct 21, 2025Updated 5 months ago
- ☆10Jun 10, 2023Updated 2 years ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago