amsuhane / ACL20-Code-switching-patterns
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
☆10Updated 3 years ago
Alternatives and similar repositories for ACL20-Code-switching-patterns:
Users that are interested in ACL20-Code-switching-patterns are comparing it to the libraries listed below
- Repository for the English-Hindi Codemixed to Monolingual English Parallel Corpus☆13Updated 6 years ago
- ☆33Updated 6 years ago
- A benchmark for code-switched NLP, ACL 2020☆74Updated 9 months ago
- Codes for the paper "Towards Sub-Word Level Compositions for Sentiment Analysis of Hi-En Code Mixed Text "☆35Updated 8 years ago
- Codebase for probing and visualizing multilingual models.☆47Updated 4 years ago
- Material for the COLING 2020 Tutorial on Multilingual NMT☆16Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆30Updated 3 years ago
- A retrieve and edit approach to generate sarcasm by reversing valence and adding incongruent common sense context☆32Updated 3 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆12Updated 6 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- Dataset of ML and NLP papers☆35Updated 2 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Updated 5 years ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Updated 3 years ago
- Interactive Neural Machine Translation tool☆53Updated last year
- NAACL 2019: Submodular optimization-based diverse paraphrasing and its effectiveness in data augmentation☆69Updated last year
- A collection of English tweets annotated in Universal Dependencies.☆39Updated 3 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆116Updated 3 years ago
- ☆11Updated 6 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆81Updated 3 years ago
- ☆32Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆52Updated 4 years ago
- Dataset of sentences from Hindi stories tagged with different emotion tags☆10Updated 5 years ago
- ☆20Updated 2 years ago
- Extracting useful metadata from Wikipedia dumps in any language.☆26Updated 5 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated last year
- Geometry-aware Multilingual Embeddings☆26Updated 2 years ago
- Language Identification and transliteration tool for Indian language code mixed data.☆23Updated 9 years ago
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- Formate converter from one type of qa task datasets to another type☆39Updated 6 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆55Updated 3 months ago