An interactive exploration of Transformer programming.
☆274Nov 15, 2023Updated 2 years ago
Alternatives and similar repositories for raspy
Users that are interested in raspy are comparing it to the libraries listed below
Sorting:
- ☆498Oct 18, 2024Updated last year
- ☆552Feb 5, 2024Updated 2 years ago
- Puzzles for exploring transformers☆386May 4, 2023Updated 2 years ago
- ☆28Jan 12, 2022Updated 4 years ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,869Jun 22, 2025Updated 8 months ago
- ☆26Mar 11, 2025Updated 11 months ago
- Solve puzzles. Improve your pytorch.☆3,966Jul 15, 2024Updated last year
- A puzzle to learn about prompting☆135May 12, 2023Updated 2 years ago
- train with kittens!☆63Oct 25, 2024Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆61Jul 27, 2024Updated last year
- A declarative drawing API in Python☆299Aug 28, 2024Updated last year
- What would you do with 1000 H100s...☆1,155Jan 10, 2024Updated 2 years ago
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆67Jan 31, 2026Updated last month
- Implementation of https://srush.github.io/annotated-s4☆512Jun 20, 2025Updated 8 months ago
- Fork of Flame repo for training of some new stuff in development☆19Feb 27, 2026Updated last week
- ☆16Nov 1, 2023Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆35Apr 8, 2023Updated 2 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Jan 24, 2024Updated 2 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆570Aug 7, 2025Updated 6 months ago
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- ☆20Jun 6, 2025Updated 9 months ago
- Jax implementation of x-LSTM: Extended Long Short-Term Memory by Beck et al. (2024)☆17Aug 6, 2024Updated last year
- generative programming & verification☆34Jun 19, 2025Updated 8 months ago
- ☆23Jan 27, 2025Updated last year
- ☆292Jul 15, 2024Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆88Sep 12, 2025Updated 5 months ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- A collection of my machine learning notebooks to run on google colab. Mostly ml art.☆20Jun 10, 2022Updated 3 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆199May 28, 2024Updated last year
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- Simple and readable code for training and sampling from diffusion models☆711Jun 14, 2025Updated 8 months ago
- The Energy Transformer block, in JAX☆64Dec 14, 2023Updated 2 years ago
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆87Updated this week
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Mar 31, 2022Updated 3 years ago
- [ICCV 2025, Highlight] Official Pytorch implementation of the paper: "ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mi…☆36Aug 1, 2025Updated 7 months ago
- Generate images from texts. In Russian☆19Dec 13, 2021Updated 4 years ago
- This github repository hosts the code used within my thesis work and my last publication.☆12Jul 20, 2017Updated 8 years ago
- ☆26Jun 5, 2024Updated last year