samblouir/birdie

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/samblouir/birdie)

samblouir / birdie

☆15

Alternatives and similar repositories for birdie

Users that are interested in birdie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

automl / DeltaProduct
View on GitHub
DeltaProduct is a new linear recurrent neural network architecture that uses products of generalized Householder matrices as state-transi…
☆15Oct 13, 2025Updated 9 months ago
Eliyas0007 / Pytorch-Intention
View on GitHub
Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention
☆12May 24, 2023Updated 3 years ago
catid / dataloader
View on GitHub
High-performance tokenized language data-loader for Python C++ extension
☆15Jul 22, 2024Updated 2 years ago
Benjamin-Walker / structured-linear-cdes
View on GitHub
Code for "Structured Linear CDEs: Maximally Expressive and Parallel-in-Time Sequence Models" (NeurIPS 2025, Spotlight)
☆19Feb 17, 2026Updated 5 months ago
IBM / selective-dense-state-space-model
View on GitHub
Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …
☆16Sep 18, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
haileyschoelkopf / triton-index
View on GitHub
See https://github.com/cuda-mode/triton-index/ instead!
☆11May 8, 2024Updated 2 years ago
Ryu1845 / hyena-jax
View on GitHub
Implementation of Hyena Hierarchy in JAX
☆10Apr 30, 2023Updated 3 years ago
google-deepmind / spectral_ssm
View on GitHub
☆35Apr 12, 2024Updated 2 years ago
Rocketknight1 / minimal_lczero
View on GitHub
A minimal reproduction of LCZero's training code, for ease of experimentation and benchmarking
☆14Mar 4, 2024Updated 2 years ago
Yinghao-Li / GnO-IE
View on GitHub
Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"
☆16Mar 15, 2024Updated 2 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
norxornor / modded-nanogpt-jax
View on GitHub
NanoGPT speedrun in JAX. Originally at https://nor-git.pages.dev/modded-nanogpt-jax/
☆17Aug 28, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
OpenNLPLab / ETSC-Exact-Toeplitz-to-SSM-Conversion
View on GitHub
[EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…
☆14Oct 17, 2023Updated 2 years ago
defnecirci / InsightGraph
View on GitHub
InsightGraph: A Visual Journey through Materials Articles
☆18Jul 20, 2023Updated 3 years ago
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
Benjamin-Walker / selective-ssms-and-linear-cdes
View on GitHub
Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)
☆17Jan 7, 2025Updated last year
AllanYangZhou / midGPT
View on GitHub
Distributed pretraining of large language models (LLMs) on cloud TPU slices, with Jax and Equinox.
☆27Sep 29, 2024Updated last year
ag1988 / dlr
View on GitHub
The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…
☆23Dec 30, 2022Updated 3 years ago
neo-chem / awesome-chemical-data
View on GitHub
Curated list of known efforts in collecting and/or curating of chemical/materials data
☆24Dec 8, 2020Updated 5 years ago
ag1988 / mel-asr
View on GitHub
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆21Oct 11, 2024Updated last year
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rlglab / optionzero
View on GitHub
[ICLR 2025 Oral] OptionZero: A method for autonomously discovering and utilizing options in the MuZero algorithm
☆28May 18, 2025Updated last year
myrho / bright-db
View on GitHub
Offline-first, decentralized graph database of collaborative Web apps
☆15May 12, 2017Updated 9 years ago
beaver-lodge / manx
View on GitHub
MLIR backend for Nx
☆14May 24, 2024Updated 2 years ago
HomebrewML / Olmax
View on GitHub
HomebrewNLP in JAX flavour for maintable TPU-Training
☆50Jan 20, 2024Updated 2 years ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
ghezalahmad / LLMs-for-the-Design-of-Sustainable-Concretes
View on GitHub
☆15Jun 18, 2024Updated 2 years ago
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
automl / unlocking_state_tracking
View on GitHub
Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…
☆22Mar 15, 2025Updated last year
BlinkDL / LinearAttentionArena
View on GitHub
Here we will test various linear attention designs.
☆62Apr 25, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
M3RG-IITD / MaScQA
View on GitHub
☆18Jul 25, 2025Updated last year
belindal / state-tracking
View on GitHub
Code and data for paper "(How) do Language Models Track State?"
☆26Mar 31, 2025Updated last year
kazuki-irie / kv-memory-brain
View on GitHub
Official Code Repository for the paper "Key-value memory in the brain"
☆32Feb 25, 2025Updated last year
allenai / drug-combo-extraction
View on GitHub
☆22Oct 20, 2022Updated 3 years ago
KarelPeeters / kZero
View on GitHub
A from-scratch general AlphaZero implementation for board games
☆35Jul 18, 2024Updated 2 years ago
jopetty / word-problem
View on GitHub
Experiments on the impact of depth in transformers and SSMs.
☆44Oct 23, 2025Updated 9 months ago
NousResearch / StripedHyenaTrainer
View on GitHub
☆67Dec 8, 2023Updated 2 years ago