ytaek-oh / retriever
☆11Updated last year
Alternatives and similar repositories for retriever:
Users that are interested in retriever are comparing it to the libraries listed below
- ☆50Updated 2 years ago
- ☆59Updated last year
- [CVPR 2024 Highlight] ImageNet-D☆41Updated 5 months ago
- Turning to Video for Transcript Sorting☆48Updated last year
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Updated 3 years ago
- ☆46Updated 11 months ago
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆41Updated 2 years ago
- [ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization☆31Updated last year
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆83Updated last month
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated last year
- (NeurIPS 2019) Combinatorial Inference against Label Noise☆11Updated 9 months ago
- [NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection☆20Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11Updated last year
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆107Updated last year
- ☆50Updated 11 months ago
- Official PyTorch implementation of "Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis…☆44Updated last year
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆28Updated 4 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆24Updated 4 months ago
- ☆23Updated last year
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 3 years ago
- (arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆36Updated 5 months ago
- ☆57Updated 11 months ago
- ☆43Updated last year
- [NeurIPS'22] ReCo: Retrieve and Co-segment for Zero-shot Transfer☆62Updated last year
- Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022☆35Updated last year
- ☆26Updated last year
- ☆45Updated 3 weeks ago