delyan-boychev / imaginetLinks
☆10Updated 3 months ago
Alternatives and similar repositories for imaginet
Users that are interested in imaginet are comparing it to the libraries listed below
Sorting:
- research work on multimodal cognitive ai☆63Updated last month
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆16Updated last year
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆20Updated 2 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆33Updated 2 years ago
- An official PyTorch implementation for CLIPPR☆29Updated 2 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 11 months ago
- The official repo of continuous speculative decoding☆27Updated 3 months ago
- Code for the paper "Manipulating Embeddings of Stable Diffusion Prompts".☆14Updated 11 months ago
- Code for T-MARS data filtering☆35Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆40Updated last year
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- Train vector quantized CLIP models using pytorch lightning☆20Updated last year
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆29Updated 2 months ago
- Fork of Flame repo for training of some new stuff in development☆14Updated last week
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- YesBut - Multimodal Satire Comprehension Dataset☆17Updated 9 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆31Updated last year
- ☆65Updated last year
- ☆28Updated 11 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- Un-*** 50 billions multimodality dataset☆23Updated 2 years ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 8 months ago
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆35Updated 11 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 6 months ago
- ☆32Updated 8 months ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆52Updated 5 months ago