misko / human_descentLinks
β37Updated 2 weeks ago
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- π§± Modula software packageβ299Updated 2 months ago
- Getting crystal-like representations with harmonic lossβ192Updated 7 months ago
- Implementation of Diffusion Transformer (DiT) in JAXβ294Updated last year
- Graph neural networks in JAX.β68Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)β102Updated last month
- β283Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ147Updated last month
- β44Updated 2 months ago
- β20Updated 7 months ago
- Simple Transformer in Jaxβ139Updated last year
- The boundary of neural network trainability is fractalβ217Updated last year
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEsβ¦β136Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β193Updated last year
- Minimal yet performant LLM examples in pure JAXβ187Updated last month
- Compositional Linear Algebraβ490Updated 3 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.β173Updated 2 years ago
- Universal Notation for Tensor Operations in Python.β442Updated 6 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorchβ96Updated 3 months ago
- WIPβ93Updated last year
- Annotated version of the Mamba paperβ490Updated last year
- Deep Learning, an Energy Approachβ218Updated 4 months ago
- β197Updated 2 months ago
- Automatic gradient descentβ215Updated 2 years ago
- The history files when recording human interaction while solving ARC tasksβ117Updated last week
- β456Updated last year
- β28Updated last month
- Because we don't want a jupyter notebook mess...β61Updated 4 months ago
- Efficient optimizersβ276Updated 2 weeks ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 secondsβ321Updated 3 months ago
- For optimization algorithm research and development.β542Updated this week