pytorch implementation of XMC-GAN
☆11Jun 2, 2021Updated 4 years ago
Alternatives and similar repositories for XMC-GAN
Users that are interested in XMC-GAN are comparing it to the libraries listed below
Sorting:
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- This is the public repo for the course HMMA238 'Software Development'☆10Apr 20, 2021Updated 4 years ago
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 2 years ago
- Modification of the original Mask/Faster R-CNN☆12Dec 13, 2020Updated 5 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- ☆11Feb 18, 2020Updated 6 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 5 years ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆14Jul 14, 2025Updated 7 months ago
- ☆10Jul 11, 2022Updated 3 years ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- ☆11Aug 17, 2021Updated 4 years ago
- CVPR 2025 Accepted Papers☆24Dec 20, 2025Updated 2 months ago
- ☆12Jul 11, 2022Updated 3 years ago
- ☆11Jun 4, 2021Updated 4 years ago
- Official repository for Scone (Subject-driven Composition and Distinction Enhancement) model, designed to support multi-subject compositi…☆28Jan 14, 2026Updated last month
- Official implementation of ICCV 2025 paper - DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization☆22Jul 13, 2025Updated 7 months ago
- ☆96Feb 20, 2026Updated 2 weeks ago
- ☆48Aug 2, 2021Updated 4 years ago
- Code base for paper "Finding Structural Knowledge in Multimodal-BERT". Framework for probing and code for creating Scene Trees.☆10May 19, 2022Updated 3 years ago
- Code of ICME2024 Paper: Video Object Segmentation with Dynamic Query Modulation☆12Mar 23, 2024Updated last year
- Synthetic Faces High Quality - Text2Image (SFHQ-T2I) Dataset. 122,726 curated 1024x1024 synthetic face images☆16Oct 14, 2024Updated last year
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- Centralized library for evaluation of generated images☆19Aug 7, 2023Updated 2 years ago
- ☆12Jul 21, 2025Updated 7 months ago
- A GCN based visual question generation model☆13Aug 21, 2019Updated 6 years ago
- ☆11Aug 17, 2018Updated 7 years ago
- Code and models for Molecule-Morphology Contrastive Pretraining (MoCoP)☆13Apr 12, 2024Updated last year
- ☆19Jan 10, 2026Updated last month
- FPV Drone Racing VIO competition.☆11Aug 17, 2020Updated 5 years ago
- Fine-tune BERT models to classify Arabic text by different dialects.☆17Aug 8, 2023Updated 2 years ago
- Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation☆18Feb 9, 2025Updated last year
- Textual Entailment Using Pytorch BERT pretrained model☆11Oct 17, 2022Updated 3 years ago
- The implementation of 'M3Net: Multilevel, Mixed and Multistage Attention Network for Salient Object Detection'.☆12Apr 18, 2025Updated 10 months ago
- ☆14Aug 5, 2024Updated last year
- ☆14May 10, 2021Updated 4 years ago
- A repository containing the code for the paper "Incorporating Domain Knowledge into Medical NLI using Knowledge Graphs" EMNLP 2019☆13Nov 2, 2019Updated 6 years ago
- Statistics on the space of asymmetric networks via Gromov-Wasserstein distance☆15Jun 13, 2020Updated 5 years ago
- source code for NeurIPS'24 paper "Towards Calibrated Robust Fine-Tuning of Vision-Language Models"☆14Oct 31, 2025Updated 4 months ago
- ☆21Jun 3, 2023Updated 2 years ago