misko / human_descentLinks
β38Updated 8 months ago
Alternatives and similar repositories for human_descent
Users that are interested in human_descent are comparing it to the libraries listed below
Sorting:
- Implementation of Diffusion Transformer (DiT) in JAXβ290Updated last year
- π§± Modula software packageβ216Updated 2 weeks ago
- Getting crystal-like representations with harmonic lossβ193Updated 4 months ago
- The boundary of neural network trainability is fractalβ215Updated last year
- Graph neural networks in JAX.β67Updated last year
- β275Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.β172Updated 2 years ago
- β21Updated 4 months ago
- β174Updated 4 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.β394Updated 4 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesβ143Updated 2 months ago
- Compositional Linear Algebraβ487Updated last week
- β115Updated 2 months ago
- Bare-bones implementations of some generative models in Jax: diffusion, normalizing flows, consistency models, flow matching, (beta)-VAEsβ¦β132Updated last year
- β43Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)β190Updated last year
- Annotated version of the Mamba paperβ487Updated last year
- Flow-matching algorithms in JAXβ100Updated last year
- For optimization algorithm research and development.β524Updated this week
- Latent Program Network (from the "Searching Latent Program Spaces" paper)β93Updated 5 months ago
- β442Updated 9 months ago
- β144Updated this week
- Ο-GPT: A New Approach to Autoregressive Modelsβ67Updated 11 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.β184Updated 10 months ago
- Use Jax functions in Pytorchβ248Updated 2 years ago
- Minimal GPT (~350 lines with a simple task to test it)β62Updated 7 months ago
- Dion optimizer algorithmβ259Updated this week
- A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.β290Updated 11 months ago
- Simple Transformer in Jaxβ138Updated last year
- The history files when recording human interaction while solving ARC tasksβ114Updated last week