misko / human_descentLinks
☆37Updated 10 months ago
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- 🧱 Modula software package☆282Updated last month
- Implementation of Diffusion Transformer (DiT) in JAX☆292Updated last year
- ☆20Updated 6 months ago
- Getting crystal-like representations with harmonic loss☆193Updated 6 months ago
- The boundary of neural network trainability is fractal☆217Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆98Updated 2 weeks ago
- ☆283Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆96Updated 2 months ago
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆173Updated 2 years ago
- Graph neural networks in JAX.☆68Updated last year
- ☆193Updated last month
- ☆28Updated 2 weeks ago
- Minimal yet performant LLM examples in pure JAX☆184Updated 3 weeks ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆100Updated 2 weeks ago
- Dion optimizer algorithm☆361Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last week
- Deep Learning, an Energy Approach☆215Updated 4 months ago
- Because we don't want a jupyter notebook mess...☆61Updated 4 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆305Updated 2 months ago
- ☆508Updated 2 months ago
- Compositional Linear Algebra☆491Updated 2 months ago
- σ-GPT: A New Approach to Autoregressive Models☆68Updated last year
- Universal Notation for Tensor Operations in Python.☆434Updated 6 months ago
- Efficient optimizers☆269Updated this week
- Simple Transformer in Jax☆139Updated last year
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.☆293Updated last year
- The history files when recording human interaction while solving ARC tasks☆116Updated last week
- ☆44Updated 2 months ago
- For optimization algorithm research and development.☆539Updated this week
- ☆454Updated 11 months ago