misko / human_descentLinks
β37Updated last month
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- Implementation of Diffusion Transformer (DiT) in JAXβ305Updated last year
- π§± Modula software packageβ322Updated 5 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)β107Updated 2 months ago
- β214Updated 3 weeks ago
- β21Updated 9 months ago
- β289Updated last year
- The boundary of neural network trainability is fractalβ221Updated last year
- Getting crystal-like representations with harmonic lossβ195Updated 9 months ago
- β45Updated 2 months ago
- Minimal yet performant LLM examples in pure JAXβ233Updated 2 weeks ago
- Graph neural networks in JAX.β68Updated last year
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEsβ¦β141Updated 2 years ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.β114Updated last month
- β540Updated 5 months ago
- Universal Notation for Tensor Operations in Python.β463Updated 9 months ago
- A simple library for scaling up JAX programsβ144Updated 2 months ago
- Dion optimizer algorithmβ420Updated 2 weeks ago
- For optimization algorithm research and development.β558Updated 2 weeks ago
- Simple Transformer in Jaxβ142Updated last year
- Compositional Linear Algebraβ506Updated 5 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 secondsβ349Updated 2 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ150Updated 3 months ago
- Solve puzzles. Learn CUDA.β63Updated 2 years ago
- seqax = sequence modeling + JAXβ170Updated 6 months ago
- WIPβ93Updated last year
- The history files when recording human interaction while solving ARC tasksβ117Updated this week
- An implementation of PSGD Kron second-order optimizer for PyTorchβ98Updated 6 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β198Updated last year
- Deep Learning, an Energy Approachβ239Updated 7 months ago
- β490Updated last year