kingoflolz / CLIP_JAX
Contrastive Language-Image Pretraining
☆141Updated 2 years ago
Alternatives and similar repositories for CLIP_JAX:
Users that are interested in CLIP_JAX are comparing it to the libraries listed below
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- v objective diffusion inference code for JAX.☆213Updated 2 years ago
- ☆88Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- Train vision models using JAX and 🤗 transformers☆96Updated last month
- JAX implementation ViT-VQGAN☆82Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- Simple python template☆40Updated 10 months ago
- gpu tester detects broken and slow gpus in a cluster☆68Updated 2 years ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆249Updated last year
- ☆57Updated 2 years ago
- CLOOB Conditioned Latent Diffusion training and inference code☆112Updated 2 years ago
- ☆64Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 2 years ago
- ☆151Updated last year
- LoRA for arbitrary JAX models and functions☆135Updated last year
- Latent Diffusion Language Models☆68Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated last year
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆98Updated 2 years ago
- ☆27Updated 4 years ago
- Automatically take good care of your preemptible TPUs☆36Updated last year
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- ☆58Updated 3 years ago
- Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.☆60Updated 2 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆147Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 7 months ago