soumik12345 / clip-lightning
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for clip-lightning
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆97Updated last year
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated last month
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated 6 months ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆108Updated last month
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆84Updated 2 years ago
- Contrastive Language-Audio Pretraining☆87Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Aggregating embeddings over time☆31Updated last year
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 2 years ago
- Official implementation of "Active Image Indexing"☆58Updated last year
- ☆73Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- Includes additional materials for the following keras.io blog post.☆12Updated 3 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆109Updated last year
- ☆28Updated 2 years ago
- ☆11Updated 2 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆95Updated 2 years ago
- Implementation of Zorro, Masked Multimodal Transformer, in Pytorch☆95Updated last year
- Implementation of Multistream Transformers in Pytorch☆53Updated 3 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆19Updated 3 months ago
- Utilities for PyTorch distributed☆23Updated last year
- ☆44Updated 3 years ago