mhh0318 / UniD3View external linksLinks
☆55Feb 9, 2023Updated 3 years ago
Alternatives and similar repositories for UniD3
Users that are interested in UniD3 are comparing it to the libraries listed below
Sorting:
- ☆25Nov 30, 2023Updated 2 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- Bag of MLP☆20May 31, 2021Updated 4 years ago
- ☆27Feb 9, 2023Updated 3 years ago
- ☆16Jul 7, 2023Updated 2 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- ☆64Jun 2, 2023Updated 2 years ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Jul 10, 2023Updated 2 years ago
- ☆285Aug 14, 2025Updated 6 months ago
- Funny Application of Neural Head Reenactment to Naver Webtoon☆10Mar 22, 2021Updated 4 years ago
- ☆12Dec 9, 2025Updated 2 months ago
- Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms☆11Nov 29, 2021Updated 4 years ago
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 2 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- A paper list about diffusion models for natural language processing.☆183Aug 28, 2023Updated 2 years ago
- Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)☆162May 18, 2023Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 3 years ago
- Code for the ECCV 2022 paper "Unleashing Transformers"☆185Apr 17, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks☆13Jun 12, 2023Updated 2 years ago
- PyTorch implementation of the Region Mutual Information Loss for Semantic Segmentation.☆26Oct 26, 2023Updated 2 years ago
- ☆31Jun 29, 2022Updated 3 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- ☆85Dec 4, 2022Updated 3 years ago
- The Official PyTorch Implementation of OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation☆34Jul 6, 2024Updated last year
- A practice for million-scale multi-domain universal object detection☆28Jun 13, 2024Updated last year
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 2 years ago
- Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models☆357Jul 4, 2023Updated 2 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated 11 months ago
- ☆17Oct 18, 2022Updated 3 years ago
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- [ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations☆15Oct 28, 2023Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆63Jan 18, 2023Updated 3 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- ☆58Nov 13, 2024Updated last year
- Exploring the Transfer Learning Capabilities of CLIP in Domain Generalization for Diabetic Retinopathy☆16Sep 21, 2023Updated 2 years ago
- Solution to Kaggle Santa 2021 Challenge☆14Jan 18, 2022Updated 4 years ago