smvorwerk / xlstm-cudaView external linksLinks
Cuda implementation of Extended Long Short Term Memory (xLSTM) with C++ and PyTorch ports
☆91Jun 10, 2024Updated last year
Alternatives and similar repositories for xlstm-cuda
Users that are interested in xlstm-cuda are comparing it to the libraries listed below
Sorting:
- ☆60May 28, 2024Updated last year
- Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.☆304Jun 28, 2024Updated last year
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Feb 9, 2026Updated last week
- Resources about xLSTM by Sepp Hochreiter☆318Nov 13, 2024Updated last year
- xLSTMTime for time series forecasting☆184Nov 25, 2024Updated last year
- my attempts at implementing various bits of Sepp Hochreiter's new xLSTM architecture☆133May 8, 2024Updated last year
- xLSTM as Generic Vision Backbone☆491Oct 20, 2025Updated 3 months ago
- MRC-LSTM: A Hybrid Approach of Multi-scale Residual CNN and LSTM to Predict Bitcoin Price☆11Jun 13, 2022Updated 3 years ago
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Nov 11, 2024Updated last year
- Pytorch implementation of the Gato paper from Deepmind☆12Feb 8, 2023Updated 3 years ago
- Hierarchical multi-system training framework for dynamical systems reconstruction (from Brenner et al. 2025 ICLR)☆17Mar 7, 2025Updated 11 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- Official and maintained implementation of the paper "Multi-StyleGAN: Towards Image-Based Simulation of Time-Lapse Live-Cell Microscopy" […☆11Mar 28, 2022Updated 3 years ago
- ☆33Oct 22, 2024Updated last year
- A easy to use implementation of xLSTM☆66Sep 3, 2025Updated 5 months ago
- Unofficial PyTorch Implementation of "Were RNNs All We Needed?"☆17Mar 20, 2025Updated 10 months ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated last year
- ToeffiPy is a PyTorch like autograd/deep learning library based only on NumPy.☆16Mar 28, 2022Updated 3 years ago
- Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%☆16Jun 20, 2023Updated 2 years ago
- Implementation of RL-Enabled Distributed Assignment (REDA)☆27Jul 9, 2024Updated last year
- DyEdgeGAT: Dynamic Edge via Graph Attention for Early Fault Detection in IIoT Systems☆19Sep 10, 2025Updated 5 months ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆23Dec 30, 2022Updated 3 years ago
- Terraform framework for deploying [elizaos/eliza, swarms] ai agents☆25Jun 19, 2025Updated 7 months ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Nov 15, 2020Updated 5 years ago
- PyTorch reimplementation of the DiracGAN proposed in the paper "Which Training Methods for GANs do actually Converge?" [ICML 2018].☆17Jul 12, 2021Updated 4 years ago
- Stable Diffusion Video to Video, Image to Image, Template Prompt Generation system and more, for use with any stable diffusion model☆23Sep 14, 2022Updated 3 years ago
- ☆20Jul 3, 2023Updated 2 years ago
- Implementations of various linear RNN layers using pytorch and triton☆54Aug 4, 2023Updated 2 years ago
- Official and maintained implementation of the paper "OSS-Net: Memory Efficient High Resolution Semantic Segmentation of 3D Medical Data" …☆26Apr 17, 2023Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Feb 9, 2026Updated last week
- Use OpenAI, Redis, and streamlit to recommend hotels using Large Language Models☆27Apr 15, 2025Updated 10 months ago
- 2D Convolutional KAN Layers with different types of activation functions☆11Sep 29, 2024Updated last year
- Library for Event Synchronization and Event Coincidence Analysis☆14Jan 12, 2025Updated last year
- This is an official implementation for "Are Transformers Effective for Time Series Forecasting?"☆27Jul 27, 2023Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Feb 10, 2026Updated last week
- Code support as a published paper!☆72Apr 29, 2024Updated last year
- Implementation of related angular-margin-based classification loss functions for training (face) embedding models: SphereFace, CosFace, A…☆26May 21, 2024Updated last year
- ☆32Jan 7, 2024Updated 2 years ago