gentaiscool / code-switching-papers
A curated list of research papers and resources on code-switching
☆307Updated 2 months ago
Alternatives and similar repositories for code-switching-papers:
Users that are interested in code-switching-papers are comparing it to the libraries listed below
- Yet Another Neural Machine Translation Toolkit☆177Updated 8 months ago
- Tracking the progress in end-to-end speech translation☆260Updated last year
- ☆177Updated 3 years ago
- A neural word aligner based on multilingual BERT☆338Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆150Updated last week
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆357Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Automatic Mapping of Disfluency Annotations for corrected version of Switchboard☆17Updated 5 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆340Updated 2 years ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆225Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- Repository for SLURP paper☆98Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆148Updated 2 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆73Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆51Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- Repository to collect and categorize Grammatical Error Correction papers.☆116Updated 4 months ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆53Updated 7 months ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆68Updated 4 years ago
- Implementation of meta-transfer-learning for ASR and LM (ACL 2020)☆50Updated 4 years ago
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆41Updated 3 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated last month
- ☆34Updated 2 years ago
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆307Updated last year
- A simple library for querying the URIEL typological database.☆87Updated 10 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆267Updated last month
- Universal Romanizer that can convert any unicode script to roman (latin) script☆179Updated 7 months ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago