Scalable and Stable Parallelization of Nonlinear RNNS
☆29Oct 21, 2025Updated 4 months ago
Alternatives and similar repositories for elk
Users that are interested in elk are comparing it to the libraries listed below
Sorting:
- Parallelizing non-linear sequential models over the sequence length☆56Jun 23, 2025Updated 8 months ago
- Display tensors directly from GPU☆11Oct 12, 2025Updated 4 months ago
- ☆21Oct 22, 2025Updated 4 months ago
- ☆13Dec 15, 2025Updated 2 months ago
- nanoGPT using Equinox☆15Mar 3, 2023Updated 2 years ago
- Attention Kernels for Symmetric Power Transformers☆129Sep 25, 2025Updated 5 months ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- Repo for solving arc problems with an Neural Cellular Automata☆23May 21, 2025Updated 9 months ago
- ☆17Jun 11, 2025Updated 8 months ago
- FlashRNN - Fast RNN Kernels with I/O Awareness☆175Oct 20, 2025Updated 4 months ago
- ☆19Dec 4, 2025Updated 2 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆21Nov 4, 2024Updated last year
- [ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"☆53Apr 26, 2024Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆56Aug 20, 2024Updated last year
- ☆24Oct 21, 2024Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- GP Sinkhorn Implementation, paper: https://www.mdpi.com/1099-4300/23/9/1134☆23May 1, 2022Updated 3 years ago
- Psychoacoustic Calibration for Efficient Neural Audio Coding☆26Sep 26, 2023Updated 2 years ago
- speed-running solving robot manipulation tasks☆24Oct 31, 2024Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Sep 12, 2024Updated last year
- ☆30Dec 2, 2024Updated last year
- ☆27May 3, 2024Updated last year
- nanoGPT-like codebase for LLM training☆116Nov 7, 2025Updated 3 months ago
- Autoregressive Image Generation☆31Jun 13, 2025Updated 8 months ago
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 4 months ago
- ☆30Nov 5, 2023Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- ☆32May 26, 2024Updated last year
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year
- ☆35Apr 12, 2024Updated last year
- Official implementation of the NeurIPS 24 paper of statistical flow matching (SFM) for discrete generation.☆44Nov 7, 2024Updated last year
- ☆316Jan 8, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A massively parallel, optimal functional runtime in Rust☆31Aug 7, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆38Apr 15, 2024Updated last year
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated last month
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆155Oct 13, 2023Updated 2 years ago