louislva / deepmind-perceiver
My implementation of DeepMind's Perceiver
☆63Updated 3 years ago
Alternatives and similar repositories for deepmind-perceiver:
Users that are interested in deepmind-perceiver are comparing it to the libraries listed below
- Implementation of Feedback Transformer in Pytorch☆105Updated 4 years ago
- An alternative to convolution in neural networks☆254Updated 11 months ago
- Lightweight Hyperparameter Optimization 🚂☆145Updated 6 months ago
- Unofficial implementation of Perceiver IO☆120Updated 2 years ago
- ☆39Updated 2 years ago
- Trains Transformer model variants. Data isn't shuffled between batches.☆141Updated 2 years ago
- ☆153Updated 4 years ago
- A library for evaluating representations.☆76Updated 3 years ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆41Updated 6 months ago
- 🧀 Pytorch code for the Fromage optimiser.☆123Updated 8 months ago
- A Pytree Module system for Deep Learning in JAX☆213Updated 2 years ago
- Python implementation of GLN in different frameworks☆98Updated 4 years ago
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Check if you have training samples in your test set☆64Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆126Updated 2 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆257Updated last year
- ☆6Updated last year
- GAN models implemented with Pytorch Lightning and Hydra configuration☆34Updated 2 years ago
- HetSeq: Distributed GPU Training on Heterogeneous Infrastructure☆106Updated last year
- Official code for the Stochastic Polyak step-size optimizer☆139Updated 9 months ago
- The most parameter efficient machine learning models on a few popular benchmarks☆42Updated 2 years ago
- a lightweight transformer library for PyTorch☆71Updated 3 years ago
- Python Research Framework☆106Updated 2 years ago
- ☆68Updated last year
- Code for scaling Transformers☆26Updated 4 years ago
- API for accessing the GraphLog dataset☆88Updated 10 months ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- Gradient Origin Networks - a new type of generative model that is able to quickly learn a latent representation without an encoder☆161Updated 4 years ago
- ☆98Updated 3 years ago