WuDaoMM this is a data project
☆74Apr 29, 2022Updated 3 years ago
Alternatives and similar repositories for WuDaoMM
Users that are interested in WuDaoMM are comparing it to the libraries listed below
Sorting:
- “悟道”源代码☆21Aug 24, 2021Updated 4 years ago
- [IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)☆21Aug 31, 2022Updated 3 years ago
- Text-to-Image generation☆35Aug 10, 2021Updated 4 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- The code of 《HAM: Hidden Anchor Mechanism for Scene Text Detection》☆11Sep 22, 2020Updated 5 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- Source code and checkpoints for legal pre-trained language models.☆15May 9, 2021Updated 4 years ago
- Code for EMNLP 2022 paper “Distilled Dual-Encoder Model for Vision-Language Understanding”☆31May 1, 2023Updated 2 years ago
- Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"☆31Nov 24, 2021Updated 4 years ago
- Subjective Image Captioning using Capsule Generative Adversarial Network☆11Jun 28, 2021Updated 4 years ago
- ☆18Jan 17, 2023Updated 3 years ago
- 不定长手写数字串识别 arbitrarily length of handwritten digit string recognition based on faster rcnn code☆16Oct 29, 2019Updated 6 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- [ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds☆43Jul 6, 2022Updated 3 years ago
- ☆65Dec 15, 2023Updated 2 years ago
- Bridging Vision and Language Model☆286Mar 27, 2023Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Feb 21, 2022Updated 4 years ago
- Code for EMNLP2021 paper "Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training"☆20Nov 12, 2021Updated 4 years ago
- [ECCV 2024] Official code for "Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation"☆18Jul 31, 2025Updated 7 months ago
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- The code of Paper "Locate Then Ask: Interpretable Stepwise Reasoning for Multi-hop Question Answering".☆22Sep 1, 2022Updated 3 years ago
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Jun 17, 2017Updated 8 years ago
- Product1M☆90Oct 12, 2022Updated 3 years ago
- Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021☆58Mar 31, 2022Updated 3 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆22Jan 25, 2023Updated 3 years ago
- (AAAI2024) Regulating Intermediate 3D Features for Vision-Centric Autonomous Driving☆21Dec 20, 2023Updated 2 years ago
- ☆28Sep 1, 2021Updated 4 years ago
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆24Apr 22, 2020Updated 5 years ago
- Team NJU-LAMDA Code For ChaLearn LAP.☆19Apr 2, 2017Updated 8 years ago
- Multi-Label Learning from Single Positive Labels - CVPR 2021☆97Nov 21, 2023Updated 2 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- ☆59Sep 23, 2022Updated 3 years ago
- The codebase for "Group-wise Contrastive Learning for Neural Dialogue Generation" (Cai et al., Findings of EMNLP 2020)☆55Feb 24, 2021Updated 5 years ago
- ☆110Dec 23, 2022Updated 3 years ago
- ☆63May 17, 2023Updated 2 years ago
- ☆57Jan 23, 2024Updated 2 years ago
- COVID-19 Related NLP Papers☆30Jan 20, 2022Updated 4 years ago
- EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering☆68Nov 26, 2021Updated 4 years ago