☆55Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for UniD3
Users that are interested in UniD3 are comparing it to the libraries listed below
Sorting:
- ☆25Nov 30, 2023Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- Bag of MLP☆20May 31, 2021Updated 4 years ago
- ☆27Feb 9, 2023Updated 3 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- ☆16Jul 7, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- ☆65Jun 2, 2023Updated 2 years ago
- ☆285Aug 14, 2025Updated 6 months ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Nov 29, 2021Updated 4 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- Funny Application of Neural Head Reenactment to Naver Webtoon☆10Mar 22, 2021Updated 4 years ago
- ☆12Dec 9, 2025Updated 2 months ago
- A paper list about diffusion models for natural language processing.☆183Aug 28, 2023Updated 2 years ago
- Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)☆162May 18, 2023Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- ☆13Nov 20, 2023Updated 2 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- PyTorch implementation of the Region Mutual Information Loss for Semantic Segmentation.☆26Oct 26, 2023Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- ☆31Jun 29, 2022Updated 3 years ago
- ☆85Dec 4, 2022Updated 3 years ago
- ☆50Nov 10, 2023Updated 2 years ago
- ☆17Mar 22, 2025Updated 11 months ago
- The Official PyTorch Implementation of OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation☆34Jul 6, 2024Updated last year
- A practice for million-scale multi-domain universal object detection☆28Jun 13, 2024Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆357Jul 4, 2023Updated 2 years ago
- ☆28Apr 28, 2023Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- [ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations☆15Oct 28, 2023Updated 2 years ago
- ☆17Oct 18, 2022Updated 3 years ago
- ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis☆126Mar 14, 2022Updated 3 years ago