francoisfleuret / picogptLinks
Minimal GPT (~350 lines with a simple task to test it)
☆63Updated last month
Alternatives and similar repositories for picogpt
Users that are interested in picogpt are comparing it to the libraries listed below
Sorting:
- Getting crystal-like representations with harmonic loss☆194Updated 9 months ago
- The boundary of neural network trainability is fractal☆221Updated last year
- ☆212Updated last year
- ☆44Updated 2 months ago
- Graph neural networks in JAX.☆68Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Because we don't want a jupyter notebook mess...☆61Updated 6 months ago
- Diffusion models in PyTorch☆120Updated 2 weeks ago
- ☆60Updated 3 years ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆158Updated 2 months ago
- Jax like function transformation engine but micro, microjax☆34Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆97Updated 5 months ago
- A practical guide to diffusion models, implemented from scratch.☆232Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆149Updated 3 months ago
- ☆32Updated 5 months ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- Fast singularity detection with kernel☆37Updated 2 years ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆74Updated 6 months ago
- ☆53Updated last year
- ☆34Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆76Updated 2 months ago
- Generative cellular automaton-like learning environments for RL.☆20Updated 11 months ago
- Induce brain-like topographic structure in your neural networks☆71Updated 5 months ago
- Code for "Training-free Graph Neural Networks and the Power of Labels as Features" (TMLR 2024)☆57Updated last year
- Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.☆174Updated 2 years ago
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77Updated 2 years ago
- Implementation of Diffusion Transformer (DiT) in JAX☆300Updated last year