okarthikb/state-space-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/okarthikb/state-space-models)

okarthikb / state-space-models

☆27

Alternatives and similar repositories for state-space-models

Users that are interested in state-space-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KhoomeiK / complexity-scaling
View on GitHub
gzip Predicts Data-dependent Scaling Laws
☆35May 28, 2024Updated 2 years ago
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
watcl-lab / positional_attention
View on GitHub
Source code for the paper "Positional Attention: Expressivity and Learnability of Algorithmic Computation"
☆14May 26, 2025Updated last year
catid / spectral_ssm
View on GitHub
Implementation of Spectral State Space Models
☆16Feb 23, 2024Updated 2 years ago
mag- / gpu_benchmark
View on GitHub
Gpu benchmark
☆79Jan 28, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
stockeh / mlx-grokking
View on GitHub
Grokking on modular arithmetic in less than 150 epochs in MLX
☆15Oct 24, 2024Updated last year
johanndiep / mistral_hackathon
View on GitHub
This repository stores the source code for the Mistral Hackathon 2024 in Paris
☆17Aug 23, 2024Updated last year
changyi7231 / NFE
View on GitHub
A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.
☆10Nov 22, 2022Updated 3 years ago
prs-eth / LoRA-Ensemble
View on GitHub
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks
☆55Mar 7, 2026Updated 4 months ago
phcerdan / wolfram_model
View on GitHub
Python Wrappings for exploring Set Substitution Systems (Wolfram Models)
☆16Jun 3, 2020Updated 6 years ago
kreimanlab / occlusion-classification
View on GitHub
[PNAS'18] Recurrent computations for visual pattern completion: Classification of occluded images in humans and recurrent neural networks
☆19Sep 11, 2018Updated 7 years ago
talmolab / track-mjx
View on GitHub
☆16Jul 17, 2026Updated last week
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
sekstini / basedxl
View on GitHub
☆18Mar 18, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rezabyt / digit-addition-491p
View on GitHub
☆15Feb 24, 2026Updated 5 months ago
google-deepmind / nanodo
View on GitHub
☆304Jul 15, 2024Updated 2 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
OpenNLPLab / LASP
View on GitHub
Linear Attention Sequence Parallelism (LASP)
☆87Jun 4, 2024Updated 2 years ago
rayking99 / BlockStar
View on GitHub
A star for organising blocks and playing with transformers.
☆23Apr 28, 2024Updated 2 years ago
VITA-Group / WeLore
View on GitHub
[ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications
☆52Oct 30, 2025Updated 8 months ago
er537 / whisper_interpretability
View on GitHub
A repo to do interpretability of pre-trained acoustic models
☆15Oct 15, 2023Updated 2 years ago
shangdatalab / Deep-Contam
View on GitHub
Official implementation of Data Contamination Can Cross Language Barriers
☆12Sep 11, 2024Updated last year
berlino / seq_icl
View on GitHub
☆54May 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
VatsaDev / NanoPoor
View on GitHub
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Apr 22, 2025Updated last year
ajhalthor / scikit-learn-pipeline
View on GitHub
End to End Machine Learning Pipeline with scikit learn
☆12Mar 10, 2021Updated 5 years ago
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
kyegomez / Qwen-VL
View on GitHub
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆13Jan 29, 2024Updated 2 years ago
mdering / CoreMLZoo
View on GitHub
A few models converted from caffe to CoreMLs format.
☆15Jun 6, 2017Updated 9 years ago
vvvm23 / mamba-jax
View on GitHub
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆94Jan 25, 2024Updated 2 years ago
nicknochnack / PyReft
View on GitHub
☆16May 5, 2024Updated 2 years ago
dimarkov / pybefit
View on GitHub
Probabilistic inference for models of behaviour
☆13Mar 5, 2026Updated 4 months ago
haraschax / nograd
View on GitHub
Gradient descent is cool and all, but what if we could delete it?
☆107Aug 20, 2025Updated 11 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ekinakyurek / gpt3-arithmetic
View on GitHub
Scratchpad/Chain-of-Thought Prompts
☆12Jun 6, 2022Updated 4 years ago
coderaashir / Crypto-Pairs-Trading
View on GitHub
A Statistical Arbitrage Strategy to trade Cryptocurrency Pairs
☆14Nov 6, 2020Updated 5 years ago
joey00072 / microjax
View on GitHub
Jax like function transformation engine but micro, microjax
☆34Oct 25, 2024Updated last year
yacineMTB / just-large-models
View on GitHub
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Sep 6, 2023Updated 2 years ago
ckkissane / sae-transfer
View on GitHub
Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"
☆13Jul 18, 2024Updated 2 years ago
varunshenoy / smalltalk
View on GitHub
A browser extension that demos Gemini Nano via window.ai and Cartesia TTS ⚡️
☆38Jul 10, 2024Updated 2 years ago
christophmark / bayesianfridge
View on GitHub
Sequential Monte Carlo sampler for PyMC2 models.
☆14Apr 4, 2018Updated 8 years ago