louislva / deepmind-perceiver
My implementation of DeepMind's Perceiver
☆63Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for deepmind-perceiver
- Implementation of Feedback Transformer in Pytorch☆104Updated 3 years ago
- Check if you have training samples in your test set☆64Updated 2 years ago
- Lightweight Hyperparameter Optimization 🚂☆145Updated 2 months ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆41Updated 2 months ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Python implementation of GLN in different frameworks☆95Updated 4 years ago
- GAN models implemented with Pytorch Lightning and Hydra configuration☆34Updated 2 years ago
- ☆156Updated 4 years ago
- a lightweight transformer library for PyTorch☆72Updated 3 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆142Updated 2 years ago
- Unofficial implementation of Perceiver IO☆117Updated 2 years ago
- Neural Turing Machines in pytorch☆47Updated 2 years ago
- A Pytree Module system for Deep Learning in JAX☆214Updated last year
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- ☆66Updated last year
- 🧀 Pytorch code for the Fromage optimiser.☆122Updated 3 months ago
- Code for scaling Transformers☆26Updated 3 years ago
- Minimal standalone example of diffusion model☆154Updated 2 years ago
- A library for evaluating representations.☆76Updated 2 years ago
- Lightweight ML Experiment Logging 📖☆80Updated 2 months ago
- Official code for the Stochastic Polyak step-size optimizer☆137Updated 4 months ago
- An active learning library for Pytorch based on Lightning-Fabric.☆79Updated 6 months ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 3 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆104Updated last year
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆258Updated last year
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆78Updated 3 years ago
- An alternative to convolution in neural networks☆250Updated 7 months ago