francoisfleuret / dlc
☆41Updated last month
Alternatives and similar repositories for dlc:
Users that are interested in dlc are comparing it to the libraries listed below
- Your favourite classical machine learning algos on the GPU/TPU☆20Updated last month
- supporting pytorch FSDP for optimizers☆76Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 3 months ago
- Simple Transformer in Jax☆136Updated 7 months ago
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆53Updated 2 months ago
- An introduction to LLM Sampling☆75Updated 2 months ago
- ☆75Updated 7 months ago
- ☆20Updated 9 months ago
- ☆32Updated 2 weeks ago
- ☆27Updated 7 months ago
- A basic pure pytorch implementation of flash attention☆16Updated 3 months ago
- ☆53Updated last year
- ☆47Updated 2 months ago
- Minimal but scalable implementation of large language models in JAX☆32Updated 3 months ago
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆25Updated last week
- LLM training in simple, raw C/CUDA☆14Updated 2 months ago
- ☆211Updated 7 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆122Updated 10 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆59Updated 6 months ago
- NanoGPT (124M) quality in 2.67B tokens☆27Updated this week
- 🧱 Modula software package☆145Updated this week
- ☆40Updated 2 months ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 7 months ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆83Updated last week
- A MAD laboratory to improve AI architecture designs 🧪☆102Updated 2 months ago
- ☆25Updated last year
- ☆158Updated 2 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆24Updated 8 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆119Updated last week