artelab / Image-and-Text-fusion-for-UPMC-Food-101-using-BERT-and-CNNsView external linksLinks
☆64Jun 25, 2021Updated 4 years ago
Alternatives and similar repositories for Image-and-Text-fusion-for-UPMC-Food-101-using-BERT-and-CNNs
Users that are interested in Image-and-Text-fusion-for-UPMC-Food-101-using-BERT-and-CNNs are comparing it to the libraries listed below
Sorting:
- ☆11May 18, 2022Updated 3 years ago
- Classify image and text with ResNet and BERT models using Pytorch☆13Jul 7, 2020Updated 5 years ago
- Baseline model for multimodal classification based on images and text. Text representation obtained from pretrained BERT base model and i…☆42Aug 26, 2022Updated 3 years ago
- ☆34Mar 11, 2022Updated 3 years ago
- The 1st place solution for SIGIR 2020 E-Commerce Workshop Multimodal Product Classification Challenge☆21Aug 3, 2020Updated 5 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- Code & Data for our Paper "PATTERN-BASED CHINESE HYPERNYM-HYPONYM RELATION EXTRACTION METHOD"☆12Jan 29, 2020Updated 6 years ago
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆15Dec 10, 2022Updated 3 years ago
- Facebook Hatebook Memes Challenge☆12Jan 28, 2021Updated 5 years ago
- Fine-Grained Visual Classification via Simultaneously Learning of Multi-regional Multi-grained Features☆12Mar 2, 2021Updated 4 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…☆15Mar 25, 2023Updated 2 years ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆40Jul 29, 2023Updated 2 years ago
- Source code for training Gated Multimodal Units on MM-IMDb dataset☆100Apr 8, 2023Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Dec 9, 2022Updated 3 years ago
- A BERT-Based Transfer Learning Approach for Hate Speech Detection in Online Social Media☆43Jan 28, 2022Updated 4 years ago
- training food-101 (achieved SOTA top-1 validation acc ~=90%) using 1-cycle-policy:☆15Aug 24, 2019Updated 6 years ago
- [NeurIPS'20-Competition] Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Meme…☆61Feb 12, 2024Updated 2 years ago
- MPCA: Multilinear Principal Component Analysis of Tensor Data☆17Feb 10, 2018Updated 8 years ago
- MultiSentiNet-CIKM2017☆22Jan 9, 2018Updated 8 years ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago
- Bimodal and Unimodal Sentiment Analysis of Internet Memes (Image+Text)☆16Oct 3, 2021Updated 4 years ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Aug 9, 2022Updated 3 years ago
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆30May 9, 2022Updated 3 years ago
- Specific correspondence analysis in R☆14Aug 25, 2025Updated 5 months ago
- It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"☆16Apr 18, 2023Updated 2 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Mar 12, 2020Updated 5 years ago
- ☆21Oct 4, 2022Updated 3 years ago
- ☆24Jun 28, 2023Updated 2 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- ☆93Dec 14, 2022Updated 3 years ago
- An implementation of a full two-step recommendation pipeline applied on the Kaggle H&M data☆24Oct 17, 2022Updated 3 years ago
- ☆30May 27, 2023Updated 2 years ago
- A simple python reproduction and modification of the 2022 Ig Nobel Prize for Economics "Which Is More Important: Talent or Luck?"☆28Apr 17, 2023Updated 2 years ago
- A python package of common operations for AMRs☆29Jun 7, 2022Updated 3 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆31Dec 4, 2020Updated 5 years ago
- ☆10Feb 13, 2024Updated 2 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- PyTorch implementation for Learning with Twin Noisy Labels for Visible-Infrared Person Re-Identification (CVPR 2022).☆32Feb 1, 2024Updated 2 years ago