omerarshad/MultiModalNER

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/omerarshad/MultiModalNER)

omerarshad / MultiModalNER

Code for paper "Aiding Intra-Text Representations with Visual Context for Multimodal Named Entity Recognition"

☆16

Alternatives and similar repositories for MultiModalNER

Users that are interested in MultiModalNER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thecharm / AGBAN
View on GitHub
Code for IEEE Trans. on Multimedia (TMM) paper "Object-aware Multimodal Named Entity Recognition in Social Media Posts with Adversarial L…
☆20Mar 3, 2021Updated 5 years ago
SenticNet / multimodal-fusion
View on GitHub
Attention-based multimodal fusion for sentiment analysis
☆13Aug 14, 2018Updated 7 years ago
gdufsnlp / SWAFN
View on GitHub
Code for Paper "SWAFN: Sentimental Words Aware Fusion Network for Multimodal Sentiment Analysis", COLING2020
☆13Oct 6, 2023Updated 2 years ago
monologg / NER-Multimodal-pytorch
View on GitHub
Pytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
☆58Oct 3, 2023Updated 2 years ago
cubenlp / ACL19_Scaling_Up_Open_Tagging
View on GitHub
ACL19-Scaling Up Open Tagging from Tens to Thousands
☆17Aug 23, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CatOneTwo / CoDTS
View on GitHub
(2025 AAAI) CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework
☆16Jun 5, 2026Updated last month
YangXiaocui1215 / MVAN
View on GitHub
☆16Mar 30, 2021Updated 5 years ago
jefferyYu / UMT
View on GitHub
Preprocessed Datasets for our Multimodal NER paper
☆125Dec 17, 2022Updated 3 years ago
jd-aig / JAVE
View on GitHub
☆88Sep 15, 2020Updated 5 years ago
hammoudhasan / DiversitySSL
View on GitHub
Original code base for On Pretraining Data Diversity for Self-Supervised Learning
☆14Dec 30, 2024Updated last year
tuituidan / image-host
View on GitHub
使用springboot+minio+elasticsearch+webuploader实现图床，支持给图片打标签，使用elasticsearch搜索，支持图片压缩，支持分片上传，秒传，断点续传
☆19Oct 3, 2024Updated last year
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 3 months ago
pliang279 / factorized
View on GitHub
[ICLR 2019] Learning Factorized Multimodal Representations
☆69Aug 4, 2020Updated 5 years ago
wutong8023 / SpeechRE
View on GitHub
☆11Nov 11, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
Sjyzheishuai / Neo4j-visualization
View on GitHub
The main function of this project is neo4j visualization and query node
☆15Sep 29, 2022Updated 3 years ago
ewwink / wikipedia-wordlists-extractor
View on GitHub
Extract Unique Word Lists From Wikipedia Database
☆13May 27, 2020Updated 6 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
Multimodal-NER / RpBERT
View on GitHub
RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER
☆76Mar 31, 2023Updated 3 years ago
yuntaoshou / CBERL
View on GitHub
☆18Mar 21, 2024Updated 2 years ago
mengshiY / RCSF
View on GitHub
Code for paper "Cross-Domain Slot Filling as Machine Reading Comprehension" in IJCAI 2021
☆11Aug 24, 2021Updated 4 years ago
Serega6678 / NuNER
View on GitHub
NuNER is the family of SOTA Foundation and Zero-shot for Entity Recognition
☆15Jun 11, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
may- / joeys2t
View on GitHub
Minimalist Speech-to-Text toolkit for educational purposes
☆13Feb 1, 2024Updated 2 years ago
alexpovel / betterletter
View on GitHub
Substitute alternative spellings of special characters (e.g. German umlauts [ae, oe, ue] and [ss]) with their correct versions (ä, ö, ü, …
☆11Nov 24, 2024Updated last year
dksanyal / SpERT.PL
View on GitHub
Joint Neural Model for Entity & Relation Extraction
☆16Oct 18, 2021Updated 4 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
gaguilar / NER-WNUT17
View on GitHub
The implementation of the paper "A Multi-task Approach for Named Entity Recognition on Social Media Data," which won the WNUT-2017 Shared…
☆67Oct 21, 2017Updated 8 years ago
DianboWork / M3T-CNERTA
View on GitHub
☆11Aug 10, 2022Updated 3 years ago
Speech-Lab-IITM / data2vec-aqc
View on GitHub
Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…
☆13Mar 18, 2024Updated 2 years ago
guangkun0818 / speech2text
View on GitHub
Speech understanding system training toolkit, including tasks of ASR, SSL, LM, etc.
☆12Feb 12, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
asindel / SliTraNet
View on GitHub
Source code to "SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks"
☆10Dec 17, 2023Updated 2 years ago
CoraJung / flexible-input-slu
View on GitHub
This setup allows to train end-to-end neural models for spoken language understanding (SLU).
☆11Jun 12, 2023Updated 3 years ago
vagos / llm-clap
View on GitHub
Generate embeddings for audio files (music, speech, sounds) and text using CLAP with llm
☆22May 15, 2025Updated last year
s920128 / NAR-BERT-ASR
View on GitHub
NAR-BERT-ASR
☆10Sep 27, 2021Updated 4 years ago
AILab-UniFI / cte-dataset
View on GitHub
CTE: Contextualized Table Extraction Dataset
☆17Feb 23, 2023Updated 3 years ago
freebase-schema / freebase
View on GitHub
☆23Apr 24, 2013Updated 13 years ago
W-Wu / DEER
View on GitHub
☆12Aug 25, 2023Updated 2 years ago