lucidrains / open_clipLinks
An open source implementation of CLIP.
☆10Updated 2 years ago
Alternatives and similar repositories for open_clip
Users that are interested in open_clip are comparing it to the libraries listed below
Sorting:
- ☆28Updated 3 years ago
- JAX implementation ViT-VQGAN☆83Updated 2 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆64Updated 3 years ago
- Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch☆71Updated 2 years ago
- Contrastive Language-Image Pretraining☆38Updated 11 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- Implementation of the video diffusion model and training scheme presented in the paper, Flexible Diffusion Modeling of Long Videos, in Py…☆85Updated 3 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆103Updated last year
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆89Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆88Updated 2 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆91Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆58Updated last year
- An open source implementation of CLIP.☆32Updated 2 years ago
- Implementation of LogAvgExp for Pytorch☆36Updated 2 months ago
- ☆160Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆30Updated 2 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated 2 years ago
- FID computation in Jax/Flax.☆27Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 10 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆72Updated 3 years ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 2 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 3 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆39Updated 3 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- Implementation of NWT, audio-to-video generation, in Pytorch☆91Updated 3 years ago