misko / human_descentLinks
☆36Updated 7 months ago
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- 🧱 Modula software package☆207Updated 3 months ago
- Getting crystal-like representations with harmonic loss☆192Updated 3 months ago
- ☆21Updated 3 months ago
- ☆43Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆91Updated 4 months ago
- The boundary of neural network trainability is fractal☆210Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆279Updated last year
- ☆167Updated 3 months ago
- Simple Transformer in Jax☆138Updated last year
- ☆274Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 11 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆172Updated 2 years ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆385Updated 3 months ago
- The history files when recording human interaction while solving ARC tasks☆113Updated this week
- ☆55Updated 7 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Minimal GPT (~350 lines with a simple task to test it)☆62Updated 7 months ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆206Updated 2 months ago
- For optimization algorithm research and development.☆521Updated this week
- Graph neural networks in JAX.☆67Updated last year
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆263Updated 4 months ago
- ☆408Updated last week
- ☆135Updated this week
- Because we don't want a jupyter notebook mess...☆61Updated last month
- ☆200Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆147Updated 3 weeks ago
- DeMo: Decoupled Momentum Optimization☆189Updated 7 months ago
- ☆440Updated 9 months ago
- Automatic gradient descent☆208Updated 2 years ago