amosy3 / Text2Model
☆20Updated 2 years ago
Alternatives and similar repositories for Text2Model:
Users that are interested in Text2Model are comparing it to the libraries listed below
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆16Updated 7 months ago
- An official PyTorch implementation for CLIPPR☆29Updated last year
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆31Updated 7 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆75Updated last month
- The official code for the SALMon🍣 benchmark☆43Updated 2 months ago
- Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.☆137Updated 2 years ago
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆36Updated 8 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆128Updated last year
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆79Updated 7 months ago
- Official PyTorch Implementation for the "Model Tree Heritage Recovery" paper.☆56Updated 7 months ago
- ☆33Updated last year
- The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)☆19Updated last year
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)☆13Updated last month
- A image caption dataset about images from www.dpchallenge.com.☆12Updated 5 years ago
- TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment☆8Updated last year
- ☆21Updated 11 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆13Updated 7 months ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]☆16Updated last year
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆17Updated 2 months ago
- ☆22Updated 11 months ago
- RG-UNIT, ACM MM 2020.☆10Updated 3 years ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆19Updated 8 months ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆14Updated 2 years ago
- Official code for SeMani (CVPR 2020 oral and Journal extension)☆23Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆20Updated last year
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago