CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching
☆18Mar 29, 2021Updated 4 years ago
Alternatives and similar repositories for CodemixedNLP
Users that are interested in CodemixedNLP are comparing it to the libraries listed below
Sorting:
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Aug 13, 2023Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 2 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- A curated list of research papers and resources on code-switching☆333Jan 31, 2026Updated last month
- ☆11May 9, 2022Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆58Aug 11, 2020Updated 5 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 10 months ago
- Multilingual Open Text☆25May 8, 2025Updated 10 months ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆58Jul 30, 2024Updated last year
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆81Dec 28, 2021Updated 4 years ago
- The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)☆53Jun 12, 2022Updated 3 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- ☆20Dec 16, 2020Updated 5 years ago
- Companion Repo for the book The Applied ML Field Manual, Prithiviraj Damodaran☆12Jun 22, 2022Updated 3 years ago
- SemEval 2019 Task 4: Hyperpartisan News Detection☆10Nov 9, 2019Updated 6 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆17Oct 24, 2020Updated 5 years ago
- ☆19Mar 12, 2025Updated last year
- Micro-framework for publishing linked data☆11Aug 1, 2017Updated 8 years ago
- A benchmark for code-switched NLP, ACL 2020☆76May 28, 2024Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Sep 15, 2021Updated 4 years ago
- Formulaire en ligne qui génère une attestation de déplacement dérogatoire☆10Mar 18, 2020Updated 6 years ago
- Code and experiments for the COLING2020 paper "Conception: Multilingually-Enhanced, Human-Readable Concept Vector Representations".☆11Dec 9, 2020Updated 5 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Dec 23, 2019Updated 6 years ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 8 years ago
- BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText☆10Sep 3, 2019Updated 6 years ago
- Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022☆20Mar 18, 2022Updated 4 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- ☆10Jul 17, 2015Updated 10 years ago
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- AdaptKeyBERT: keyword/keyphrase extraction with zero-shot and few-shot semi-supervised domain adaptation☆25Sep 22, 2024Updated last year
- Rule-based Kurdish Transliterator☆10May 3, 2024Updated last year
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆15Jan 31, 2023Updated 3 years ago
- WProofreader software development kit (SDK) offers multilingual spelling & grammar check API and JavaScript libraries for rich text edito…☆13Feb 20, 2026Updated last month
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- ☆14Aug 3, 2022Updated 3 years ago