misko / human_descentLinks
β37Updated last week
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- π§± Modula software packageβ316Updated 4 months ago
- β21Updated 8 months ago
- Getting crystal-like representations with harmonic lossβ193Updated 8 months ago
- Graph neural networks in JAX.β68Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)β106Updated 3 weeks ago
- Implementation of Diffusion Transformer (DiT) in JAXβ298Updated last year
- β285Updated last year
- The boundary of neural network trainability is fractalβ221Updated last year
- β208Updated 4 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.β174Updated 2 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorchβ97Updated 4 months ago
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEsβ¦β140Updated 2 years ago
- β460Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 secondsβ335Updated last month
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.β105Updated 2 months ago
- β44Updated last month
- Minimal yet performant LLM examples in pure JAXβ214Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ149Updated 2 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β198Updated last year
- Dion optimizer algorithmβ404Updated this week
- β27Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.β186Updated last year
- The history files when recording human interaction while solving ARC tasksβ118Updated this week
- Universal Notation for Tensor Operations in Python.β453Updated 8 months ago
- Annotated version of the Mamba paperβ492Updated last year
- WIPβ93Updated last year
- Simple Transformer in Jaxβ140Updated last year
- Efficient optimizersβ277Updated last month
- For optimization algorithm research and development.β552Updated last week
- Jax Codebase for Evolutionary Strategies at the Hyperscaleβ188Updated last month