srush / Autodiff-Puzzles
☆428Updated 5 months ago
Alternatives and similar repositories for Autodiff-Puzzles:
Users that are interested in Autodiff-Puzzles are comparing it to the libraries listed below
- Puzzles for exploring transformers☆342Updated last year
- What would you do with 1000 H100s...☆1,035Updated last year
- An interactive exploration of Transformer programming.☆262Updated last year
- ☆215Updated 9 months ago
- A puzzle to learn about prompting☆126Updated last year
- For optimization algorithm research and development.☆505Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆565Updated this week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆375Updated this week
- Compositional Linear Algebra☆472Updated 2 weeks ago
- Annotated version of the Mamba paper☆481Updated last year
- Solve puzzles. Learn CUDA.☆63Updated last year
- 🧱 Modula software package☆187Updated 2 weeks ago
- A Jax-based library for designing and training small transformers.☆286Updated 7 months ago
- Puzzles for learning Triton☆1,566Updated 4 months ago
- seqax = sequence modeling + JAX☆153Updated last week
- Implementation of Diffusion Transformer (DiT) in JAX☆270Updated 10 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆529Updated last month
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆237Updated 2 weeks ago
- Named tensors with first-class dimensions for PyTorch☆320Updated last year
- Extract full next-token probabilities via language model APIs☆240Updated last year
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆365Updated last week
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆286Updated 4 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆784Updated last month
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆129Updated last year
- ☆419Updated 9 months ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆177Updated 2 months ago
- TensorDict is a pytorch dedicated tensor container.☆906Updated this week
- UNet diffusion model in pure CUDA☆601Updated 9 months ago
- Uncertainty quantification with PyTorch☆349Updated last week
- CLU lets you write beautiful training loops in JAX.☆337Updated this week