gentaiscool / code-switching-papers
A curated list of research papers and resources on code-switching
☆313Updated 4 months ago
Alternatives and similar repositories for code-switching-papers
Users that are interested in code-switching-papers are comparing it to the libraries listed below
Sorting:
- Yet Another Neural Machine Translation Toolkit☆178Updated 2 months ago
- ☆178Updated 3 years ago
- Tracking the progress in end-to-end speech translation☆260Updated last year
- A tool that locates, downloads, and extracts machine translation corpora☆154Updated 2 weeks ago
- A neural word aligner based on multilingual BERT☆348Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆49Updated last year
- Automatic Mapping of Disfluency Annotations for corrected version of Switchboard☆17Updated 5 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆54Updated 9 months ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆75Updated last year
- Zero -- A neural machine translation system☆150Updated 2 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆360Updated last year
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆54Updated 4 years ago
- A Neural Machine Translation toolkit for research purpose☆82Updated 3 months ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- cLang-8 is a dataset for grammatical error correction.☆104Updated 2 years ago
- ☆34Updated 2 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆118Updated last month
- Tracking the progress in non-autoregressive generation (translation, transcription, etc.)☆307Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆277Updated 3 months ago
- Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus☆53Updated 5 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆75Updated 4 years ago
- This repository contains source code for the paper "Language Model Prior for Low-Resource Neural Machine Translation"☆42Updated 4 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆32Updated 3 years ago
- A repository containing the code for speech translation papers.☆21Updated 3 years ago
- A tool for holistic analysis of language generations systems☆468Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆197Updated 9 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 4 years ago
- A simple library for querying the URIEL typological database.☆90Updated last year
- a tool for calcualting character n-gram F score☆72Updated 2 years ago