rom1504 / CLIP
Contrastive Language-Image Pretraining
☆37Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for CLIP
- ☆26Updated last year
- ☆28Updated 2 years ago
- Utilities for PyTorch distributed☆23Updated last year
- A JAX nn library☆21Updated 8 months ago
- Implementation of LogAvgExp for Pytorch☆32Updated 2 years ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆17Updated 2 weeks ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- JAX implementation ViT-VQGAN☆77Updated 2 years ago
- A JAX implementation of the continuous time formulation of Consistency Models☆83Updated last year
- Latent Diffusion Language Models☆67Updated last year
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- FID computation in Jax/Flax.☆24Updated 4 months ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆52Updated last year
- Official repository for MaGNET, ICLR 2022☆26Updated 2 years ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆84Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆97Updated last year
- ☆154Updated 2 years ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆53Updated 2 months ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆76Updated 2 weeks ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆48Updated 3 months ago