misko / human_descent
โ36Updated last month
Alternatives and similar repositories for human_descent:
Users that are interested in human_descent are comparing it to the libraries listed below
- Graph neural networks in JAX.โ67Updated 7 months ago
- ๐งฑ Modula software packageโ134Updated this week
- โ40Updated 2 months ago
- A package for defining deep learning models using categorical algebraic expressions.โ59Updated 6 months ago
- Simple Transformer in Jaxโ130Updated 7 months ago
- โ58Updated 2 years ago
- Flow-matching algorithms in JAXโ83Updated 5 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)โ45Updated 2 months ago
- โ204Updated 6 months ago
- Your favourite classical machine learning algos on the GPU/TPUโ20Updated 3 weeks ago
- A MAD laboratory to improve AI architecture designs ๐งชโ102Updated last month
- supporting pytorch FSDP for optimizersโ75Updated last month
- โ150Updated last month
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resourcesโ119Updated 2 weeks ago
- โ53Updated last year
- ฯ-GPT: A New Approach to Autoregressive Modelsโ61Updated 5 months ago
- Tensor Network Library with Autogradโ158Updated this week
- โ50Updated 3 months ago
- Exact OU processes with JAXโ38Updated 4 months ago
- โ46Updated 2 months ago
- โ27Updated 6 months ago
- An introduction to LLM Samplingโ75Updated last month
- Pytorch-like dataloaders in JAX.โ72Updated 3 months ago
- Implementation of Diffusion Transformer (DiT) in JAXโ261Updated 7 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ91Updated 2 months ago
- โ48Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingโ121Updated 9 months ago
- This is a port of Mistral-7B model in JAXโ30Updated 6 months ago
- Because we don't want a jupyter notebook mess...โ59Updated last month
- The history files when recording human interaction while solving ARC tasksโ96Updated this week