PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.
☆89Mar 1, 2024Updated 2 years ago
Alternatives and similar repositories for S4Torch
Users that are interested in S4Torch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Structured state space sequence models☆2,873Jul 17, 2024Updated last year
- ☆35Nov 22, 2024Updated last year
- Implementation of https://srush.github.io/annotated-s4☆514Jun 20, 2025Updated 9 months ago
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆82Apr 26, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆22Jan 22, 2024Updated 2 years ago
- Selective Copying Task with Mamba Model. This repository contains a simple implementation for reproducing the selective copying task with…☆12Jun 3, 2024Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- ☆317Jan 8, 2025Updated last year
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 9 months ago
- Official code base for NeurIPS 2021 SVRHM Workshop poster "On the use of Cortical Magnification and Saccades as Biological Proxies for Da…☆13Jun 28, 2023Updated 2 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- ☆12May 14, 2024Updated last year
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Reading list for research topics in state-space models☆355Jun 11, 2025Updated 9 months ago
- ☆68Oct 22, 2024Updated last year
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆68Apr 24, 2024Updated last year
- ☆63Jul 11, 2023Updated 2 years ago
- JAX exponential map normalising flows on sphere☆17Oct 4, 2020Updated 5 years ago
- Simple, minimal implementation of the Mamba SSM in one pytorch file. Using logcumsumexp (Heisen sequence).☆132Oct 18, 2024Updated last year
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆29Jul 9, 2024Updated last year
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆70Mar 11, 2022Updated 4 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- ☆17Dec 19, 2024Updated last year
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- ☆24Sep 25, 2024Updated last year
- Official repository for the paper "Automating Continual Learning"☆18Jun 11, 2025Updated 9 months ago
- ☆18Oct 26, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- ☆22Jul 12, 2021Updated 4 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- ☆16Jul 11, 2023Updated 2 years ago
- Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.☆13Jul 19, 2024Updated last year
- Lib for event and sample based performance metrics☆24Jan 16, 2026Updated 2 months ago