gentaiscool / code-switching-papers
A curated list of research papers and resources on code-switching
☆298Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for code-switching-papers
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- Tracking the progress in end-to-end speech translation☆254Updated last year
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- ☆175Updated 3 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆114Updated last month
- A tool that locates, downloads, and extracts machine translation corpora☆147Updated 5 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆154Updated 3 months ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆112Updated 5 years ago
- Zero -- A neural machine translation system☆149Updated last year
- ☆359Updated 2 years ago
- cLang-8 is a dataset for grammatical error correction.☆104Updated 2 years ago
- Automatic Mapping of Disfluency Annotations for corrected version of Switchboard☆17Updated 5 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆336Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆351Updated last year
- A guide to building language technology in new languages.☆57Updated 2 years ago
- A Neural Machine Translation toolkit for research purpose☆82Updated this week
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆440Updated 7 months ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆67Updated 3 years ago
- A tool for holistic analysis of language generations systems☆467Updated 2 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆70Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆50Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- CMU Wilderness Multilingual Speech Dataset☆272Updated 5 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus☆52Updated 4 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆52Updated 3 months ago
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆49Updated 4 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- Easier Automatic Sentence Simplification Evaluation☆159Updated last year