openai / LHOPT
Learned Hyperparameter Optimizers
☆58Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for LHOPT
- Code for the paper "Understanding RL Vision"☆43Updated last year
- Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.☆121Updated last year
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆78Updated 3 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆72Updated 3 years ago
- Submissions for AI and Efficiency SOTA's☆56Updated 4 years ago
- Python Research Framework☆107Updated 2 years ago
- ☆66Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- ☆66Updated last year
- JAX implementation ViT-VQGAN☆55Updated 2 years ago
- Contrastive Language-Image Pretraining☆143Updated 2 years ago
- ☆24Updated 5 years ago
- 🎢 Creating and sharing simulation environments for embodied and synthetic data research☆190Updated last year
- Experiment. Plot. Tabulate.☆68Updated 2 months ago
- PyTorch Package For Quasimetric Learning☆42Updated last week
- 🧀 Pytorch code for the Fromage optimiser.☆122Updated 3 months ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆98Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆185Updated 2 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆104Updated 2 years ago
- [NeurIPS 2022] DataMUX: Data Multiplexing for Neural Networks☆59Updated last year
- Train very large language models in Jax.☆195Updated last year
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- ☆17Updated last year
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆124Updated 2 years ago
- ☆64Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆47Updated last year