mttn2023 / mttn
MTTN: Multi-Pair Text to Text Narratives for Prompt Generation
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mttn
- Guide diffusion on ImageBind embedding similarity☆28Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- ☆28Updated 3 weeks ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated 7 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Official repository for the paper "Images as Weight Matrices: Sequential Image Generation Through Synaptic Learning Rules" (ICLR 2023)☆12Updated last year
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 2 years ago
- Directed masked autoencoders☆14Updated last year
- Aggregating embeddings over time☆31Updated last year
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- ☆27Updated 3 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆19Updated 3 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 3 years ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- ☆12Updated 2 months ago
- ☆21Updated last year
- ☆26Updated last year
- ☆0Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆30Updated 4 months ago
- A fast approach for translating a series of text prompts into a video. The 2022 NeurIPS Workshop on Machine Learning for Creativity and D…☆32Updated last year
- ViT trained on COYO-Labeled-300M dataset☆29Updated 2 years ago