kylebgorman / EditTransducerLinks

Python implementation of Levenshtein distance and Levenshtein automata matching

☆27

Alternatives and similar repositories for EditTransducer

Users that are interested in EditTransducer are comparing it to the libraries listed below

Sorting:

rwsproat / text-normalization-data
Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.
☆76Updated 4 years ago
awni / py-arpa-lm
Python API for reading and querying ARPA formatted language models.
☆33Updated 10 years ago
jniehues-kit / SLT.KIT
Spoken Language Translation System
☆13Updated 6 years ago
kpu / preprocess
Corpus preprocessing
☆98Updated last year
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 3 months ago
vsiivola / variKN
A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…
☆40Updated 11 months ago
se4u / neural_wfst
Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner
☆28Updated 6 years ago
matthewfl / openfst-wrapper
☆28Updated 4 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
revdotcom / words2num
Convert words to numbers
☆20Updated 3 years ago
shauryr / google_text_normalization
RNNs for Text Normalization
☆39Updated 7 years ago
dowobeha / ldc_downloader
Script to download corpora from the Linguistic Data Consortium (LDC)
☆32Updated last year
kmario23 / KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
☆114Updated 6 years ago
sfischer13 / python-arpa
Python library for n-gram models in ARPA format
☆40Updated 2 years ago
mattiadg / FBK-Fairseq-ST
An adaptation of Fairseq to (End-to-end) speech translation.
☆22Updated 3 years ago
jpuigcerver / openfst-python
Self-contained Python package for OpenFst
☆51Updated 2 years ago
belambert / edit-distance
Python library for computing edit distance between arbitrary Python sequences.
☆102Updated 5 months ago
besacier / mboshi-french-parallel-corpus
☆22Updated 3 years ago
clp-research / deep_disfluency
Deep Learning systems for training and testing disfluency detection and related tasks on speech data.
☆59Updated 6 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated last year
irstlm-team / irstlm
☆77Updated 2 years ago
claravania / subword-lstm-lm
LSTM Language Model with Subword Units Input Representations
☆42Updated 4 years ago
shtoshni / speech_parsing
Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"
☆12Updated 8 years ago
qiujiali / lattice_rnn
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
☆16Updated 4 years ago
bootphon / pygamma-agreement
Gamma Agreement in Python
☆44Updated last year
arenjansen / ZRTools
Zero-Resource Speech Discovery, Search, and Evaluation Tools
☆29Updated 10 years ago
M4t1ss / SoftAlignments
Neural macine translation soft alignment visualisations for web and command line
☆72Updated 3 years ago
cognibit / Text-Normalization-Demo
Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain
☆60Updated 6 years ago
alicank / Translation-Augmented-LibriSpeech-Corpus
Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…
☆44Updated 3 years ago
antho-rousseau / XenC
XenC: open-source data selection tool for NLP
☆64Updated 9 years ago