Repository for StripedHyena, a state-of-the-art beyond Transformer architecture
☆411Mar 7, 2024Updated last year
Alternatives and similar repositories for stripedhyena
Users that are interested in stripedhyena are comparing it to the libraries listed below
Sorting:
- Biological foundation modeling from molecular to genome scale☆1,479Feb 16, 2026Updated 2 weeks ago
- Pretraining infrastructure for multi-hybrid AI model architectures☆202Feb 20, 2026Updated last week
- Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena☆764Apr 22, 2025Updated 10 months ago
- ☆62Dec 8, 2023Updated 2 years ago
- [ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome☆459Jan 1, 2026Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.☆257Feb 24, 2026Updated last week
- Inference and numerics for multi-hybrid AI model architectures☆92Dec 16, 2025Updated 2 months ago
- Genome modeling and design across all domains of life☆3,336Updated this week
- Convolutions for Sequence Modeling☆913Jun 13, 2024Updated last year
- Reference implementation of Megalodon 7B model☆528May 17, 2025Updated 9 months ago
- Mamba-Chat: A chat LLM based on the state-space model architecture 🐍☆942Mar 3, 2024Updated 2 years ago
- ☆16Feb 14, 2025Updated last year
- Effect of tokenization on transformers for biological sequence☆22Dec 31, 2025Updated 2 months ago
- FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models☆166Jul 30, 2025Updated 7 months ago
- Benchmarks for classification of genomic sequences☆172Aug 14, 2025Updated 6 months ago
- Bilingual Language Model for Protein Sequence and Structure☆301Jan 2, 2025Updated last year
- Pytorch implementation of the Borzoi model from Calico, and Flashzoi, a 3x faster Borzoi enhancement.☆97Nov 13, 2025Updated 3 months ago
- An annotated implementation of the Hyena Hierarchy paper☆34May 28, 2023Updated 2 years ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"☆248Jun 6, 2025Updated 8 months ago
- Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"☆562Dec 28, 2024Updated last year
- ☆45Feb 11, 2026Updated 3 weeks ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- ☆16Mar 1, 2025Updated last year
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores☆343Dec 28, 2024Updated last year
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆372Dec 12, 2024Updated last year
- ☆2,253Jan 26, 2026Updated last month
- Generation of protein sequences and evolutionary alignments via discrete diffusion models☆663Jan 15, 2026Updated last month
- Structure-conditioned masked language modeling for protein sequence design☆71Jan 31, 2024Updated 2 years ago
- Bi-Directional Equivariant Long-Range DNA Sequence Modeling☆226Jun 17, 2025Updated 8 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆138Dec 17, 2024Updated last year
- Mamba SSM architecture☆17,257Feb 18, 2026Updated 2 weeks ago
- ☆74Oct 19, 2024Updated last year
- ☆23Jul 8, 2025Updated 7 months ago
- Mamba support for transformer lens☆19Sep 17, 2024Updated last year
- 🧬 Generative modeling of regulatory DNA sequences with diffusion probabilistic models 💨☆465Dec 22, 2025Updated 2 months ago
- Saprot: Protein Language Model with Structural Alphabet (AA+3Di)☆567Nov 19, 2025Updated 3 months ago
- ☆868Dec 8, 2023Updated 2 years ago
- RNA-seq prediction with deep convolutional neural networks.☆227Aug 28, 2025Updated 6 months ago
- ☆32Jan 1, 2024Updated 2 years ago