Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.
☆10Aug 13, 2023Updated 2 years ago
Alternatives and similar repositories for code-mixed-lid
Users that are interested in code-mixed-lid are comparing it to the libraries listed below
Sorting:
- Language Identification and transliteration tool for Indian language code mixed data.☆24Feb 29, 2016Updated 10 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆58Jul 30, 2024Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆58Aug 11, 2020Updated 5 years ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 4 years ago
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10May 1, 2025Updated 10 months ago
- Transfering images python from server to client UI python socket☆13Sep 30, 2020Updated 5 years ago
- ☆12Nov 7, 2024Updated last year
- ☆24May 6, 2025Updated 10 months ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Crawling indonesia wordlist☆14Feb 9, 2021Updated 5 years ago
- Source code of phaazon.net.☆11Sep 17, 2024Updated last year
- This repository is dedicated to development of code-mixed language resources.☆27Jul 22, 2023Updated 2 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- 基于Paddle进行语义检索并部署上线,支持多语言 This code is based on Paddle to do a semantic search, and deploy it. Multilingual support☆13Aug 11, 2022Updated 3 years ago
- It's a Google Assistant action for PDPU that tells you about which classes and labs you have today.☆11Mar 24, 2019Updated 6 years ago
- MLP implementation in Python with PyTorch for the MNIST-fashion dataset (90+ on test)☆11Dec 24, 2021Updated 4 years ago
- CalBERT - Code-mixed Adaptive Language representations using BERT, published at AAAI-MAKE 2022☆13Dec 18, 2023Updated 2 years ago
- ☆13Jun 14, 2024Updated last year
- Hinglish Text Classification☆30Jun 12, 2023Updated 2 years ago
- SemEval 2019 Task 4: Hyperpartisan News Detection☆10Nov 9, 2019Updated 6 years ago
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- ☆33Jun 20, 2018Updated 7 years ago
- English, Hausa, Igbo and Yoruba corpora and results (presented in excel files) of word-level language identification research using the c…☆16Oct 15, 2018Updated 7 years ago
- The thing that connects your pipes to your colon☆12Jul 10, 2024Updated last year
- Data Bahasa Indonesia☆18Sep 25, 2017Updated 8 years ago
- ☆16Jul 23, 2021Updated 4 years ago
- Code Repository for the IndicXNLI paper.☆15Jul 8, 2023Updated 2 years ago
- ☆17Mar 14, 2026Updated last week
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- A text-object for LHS/RHS of assignment.☆16Oct 31, 2020Updated 5 years ago
- My configuration files☆16Feb 25, 2026Updated 3 weeks ago
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- ☆11Jun 23, 2022Updated 3 years ago
- Exact Cover Sudoku Solver☆19Dec 12, 2022Updated 3 years ago
- Towards Few-Shot Fact-Checking via Perplexity☆14Jun 11, 2021Updated 4 years ago
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- ☆12Jan 21, 2019Updated 7 years ago
- Fast and accurate systemic data extraction with LLM assistance☆42Mar 1, 2026Updated 3 weeks ago
- Automatic speech recognition (ASR) for Indonesian language built by using HTK and Julius. Web interface is built using Node.js.☆21Dec 16, 2016Updated 9 years ago