korenyoni / opus-api
OPUS (opus.nlpl.eu) Python3 API
☆14Updated this week
Related projects: ⓘ
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated last year
- Utilities for manipulating finite state transducers with the OpenFst library.☆30Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆31Updated 2 years ago
- Language data store and linguistic query API☆35Updated 3 weeks ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- Grapheme to phoneme converter for Estonian☆13Updated 3 years ago
- Expected edit distance implementation using OpenFst tools☆11Updated 9 years ago
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated last year
- universal syllabification algorithms☆43Updated last year
- Cross-Linguistic Transcription Systems☆14Updated 5 months ago
- R package for phonetic research and experimenting☆20Updated last month
- ADS Project☆14Updated 8 years ago
- A Python interface to OpenFst☆89Updated 5 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- The curation repository for the data behind Concepticon.☆32Updated this week
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆51Updated 6 months ago
- Finite-state script normalization and processing utilities☆36Updated this week
- The Seshat audio annotation management platform☆13Updated 3 years ago
- Dataset used to analyze user preferences of podcast summaries☆8Updated 2 years ago
- wrassp is a wrapper for R around Michel Scheffers's libassp (Advanced Speech Signal Processor). The libassp library aims at providing fun…☆22Updated 8 months ago
- Read-only unofficial mirror of the OpenGrm NGram Library☆8Updated 5 years ago
- My public domain speech index☆10Updated 5 years ago
- Simple, standalone python classes for training statistical language models using several popular smoothing methods.☆25Updated 11 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 weeks ago
- Proposed splits for the LREC Wikipron paper☆13Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- Course in Natural Language Processing and Applications☆10Updated last year
- American English Pronunciation Dictionary☆33Updated 6 years ago
- Phonetic and phonological vocoding platform☆16Updated 7 years ago