mttn2023 / mttn
MTTN: Multi-Pair Text to Text Narratives for Prompt Generation
☆11Updated 2 years ago
Alternatives and similar repositories for mttn:
Users that are interested in mttn are comparing it to the libraries listed below
- A dashboard for exploring timm learning rate schedulers☆19Updated 4 months ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Official repository for MaGNET, ICLR 2022☆24Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆15Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- ☆28Updated 8 months ago
- Includes additional materials for the following keras.io blog post.☆12Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- ☆29Updated 2 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- ☆32Updated 5 months ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Load any clip model with a standardized interface☆21Updated 11 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 11 months ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Fine-tune of Florence-2 for shot categorization.☆22Updated 3 weeks ago
- We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…☆16Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- ☆12Updated 7 months ago
- A tool for benchmarking image generation models.☆31Updated 2 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 10 months ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆39Updated 3 years ago