bootphon / pygamma-agreement
Gamma Agreement in Python
☆43Updated 10 months ago
Alternatives and similar repositories for pygamma-agreement:
Users that are interested in pygamma-agreement are comparing it to the libraries listed below
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 4 years ago
- ☆22Updated 2 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- ☆10Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆11Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 3 months ago
- Multilingual Open Text☆25Updated 2 months ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- SegEval Segmentation Evaluation Package☆55Updated last year
- Utilities for Processing the Switchboard Dialogue Act Corpus☆67Updated 3 years ago
- Tool to fix bitexts and tag near-duplicates for removal☆29Updated 5 months ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 4 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated last month
- A simple neural truecaser written in pytorch and allennlp.☆32Updated 7 months ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆16Updated 2 years ago
- ☆24Updated 5 years ago
- Convert words to numbers☆20Updated 2 years ago
- ☆17Updated last year
- ☆17Updated 5 months ago
- ☆48Updated 2 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆11Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆113Updated 5 years ago
- phone inventory library☆16Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆76Updated 4 months ago
- ☆14Updated 5 years ago