Code for paper "Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition"
☆16Aug 19, 2019Updated 6 years ago
Alternatives and similar repositories for MultiModalNER
Users that are interested in MultiModalNER are comparing it to the libraries listed below
Sorting:
- Implementation of Adaptive Co-attention Network for Named Entity Recognition in Tweets in AAAI2018.☆71Mar 9, 2018Updated 8 years ago
- Code recipe for "Multimodal One-Shot Learning of Speech and Images"☆11Nov 21, 2018Updated 7 years ago
- Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…☆20Mar 3, 2021Updated 5 years ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- Pytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)☆58Oct 3, 2023Updated 2 years ago
- 基于多模态的属性抽取☆45Aug 6, 2020Updated 5 years ago
- Code for Paper "SWAFN: Sentimental Words Aware Fusion Network for Multimodal Sentiment Analysis", COLING2020☆13Oct 6, 2023Updated 2 years ago
- ACL19-Scaling Up Open Tagging from Tens to Thousands☆17Aug 23, 2019Updated 6 years ago
- ☆17Mar 30, 2021Updated 4 years ago
- PyTorch implementation for the paper "The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG…☆18Sep 18, 2025Updated 6 months ago
- Preprocessed Datasets for our Multimodal NER paper☆123Dec 17, 2022Updated 3 years ago
- Repository for the ACL'22 paper "So Different Yet So Alike! Constrained Unsupervised Text Style Transfer"☆16Jan 19, 2024Updated 2 years ago
- ☆88Sep 15, 2020Updated 5 years ago
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Dec 30, 2024Updated last year
- ☆10Oct 16, 2025Updated 5 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- [NeurIPS 2022 Workshop] A Case Study with Negated Prompts using T0 (3B, 11B), InstructGPT (350M-175B), GPT-3 (350M - 175B) & OPT (125M - …☆24Sep 27, 2022Updated 3 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆26Sep 24, 2025Updated 5 months ago
- 使用springboot+minio+elasticsearch+webuploader实现图床,支持给图片打标签,使用elasticsearch搜索,支持图片压缩,支持分片上传,秒传,断点续传☆19Oct 3, 2024Updated last year
- RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER☆76Mar 31, 2023Updated 2 years ago
- NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition☆14Jun 11, 2024Updated last year
- Code for the paper "Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification" - ICDM 2019☆13Mar 25, 2023Updated 2 years ago
- ☆17Mar 21, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- Minimalist Speech-to-Text toolkit for educational purposes☆13Feb 1, 2024Updated 2 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- 🌮 Table-based KB Completer☆16Mar 13, 2024Updated 2 years ago
- ☆11Aug 10, 2022Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Generate embeddings for audio files (music, speech, sounds) and text using CLAP with llm☆20May 15, 2025Updated 10 months ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.☆11Feb 12, 2026Updated last month
- CTE: Contextualized Table Extraction Dataset☆17Feb 23, 2023Updated 3 years ago
- EACL 2023 paper "MLASK: Multimodal Summarization of Video-based News Articles"☆12Nov 7, 2023Updated 2 years ago
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- ☆23Apr 24, 2013Updated 12 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago