☆27Jul 9, 2024Updated last year
Alternatives and similar repositories for state-space-models
Users that are interested in state-space-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- gzip Predicts Data-dependent Scaling Laws☆35May 28, 2024Updated last year
- Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.☆25Sep 29, 2024Updated last year
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆12Jun 12, 2025Updated 10 months ago
- Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"☆14May 26, 2025Updated 11 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆56Mar 7, 2026Updated last month
- Python Wrappings for exploring Set Substitution Systems (Wolfram Models)☆16Jun 3, 2020Updated 5 years ago
- [PNAS'18] Recurrent computations for visual pattern completion: Classification of occluded images in humans and recurrent neural networks☆19Sep 11, 2018Updated 7 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- ☆308Jul 15, 2024Updated last year
- A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.☆10Nov 22, 2022Updated 3 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Linear Attention Sequence Parallelism (LASP)☆88Jun 4, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 6 months ago
- ☆13Mar 30, 2026Updated last month
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- ☆14Mar 31, 2024Updated 2 years ago
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- End to End Machine Learning Pipeline with scikit learn☆12Mar 10, 2021Updated 5 years ago
- ☆45Nov 1, 2025Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated last year