isi-nlp / carmelLinks

finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests

☆14

Alternatives and similar repositories for carmel

Users that are interested in carmel are comparing it to the libraries listed below

Sorting:

besacier / mboshi-french-parallel-corpus
☆22Updated 3 years ago
coryshain / dnnseg
☆10Updated 4 years ago
dogancan / expected-edit-distance
Expected edit distance implementation using OpenFst tools
☆11Updated 10 years ago
shtoshni / g2p
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Updated 6 years ago
CUNY-CL / wikipron-modeling
Proposed splits for the LREC Wikipron paper
☆14Updated 5 years ago
jacquelineCelia / lexicon_discovery
Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL
☆10Updated 8 years ago
MLSpeech / DeepPhoneticToolsTutorial
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Updated 8 years ago
se4u / neural_wfst
Code for the paper 'Weighting Finite State Transductions with Neural Context', Pushpendre Rastogi, Ryan Cotterell, Jason Eisner
☆28Updated 6 years ago
alvations / usaarhat-repo
Hack and Tell @ Saarland University
☆19Updated 7 years ago
alpoktem / punkProse
Punctuation generation for speech transcripts using lexical and prosodic features
☆41Updated 6 years ago
kpu / preprocess
Corpus preprocessing
☆98Updated last year
claravania / subword-lstm-lm
LSTM Language Model with Subword Units Input Representations
☆42Updated 4 years ago
revdotcom / words2num
Convert words to numbers
☆20Updated 3 years ago
rnd2110 / MorphAGram
A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars
☆17Updated last year
nsmartinez / WERpp
Calculates the Word Error Rate between two text files
☆20Updated 2 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 3 months ago
mzboito / IWSLT2022_Tamasheq_data
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Updated 2 years ago
markusdr / transducersaurus
Automatically exported from code.google.com/p/transducersaurus
☆11Updated 10 years ago
jniehues-kit / SLT.KIT
Spoken Language Translation System
☆13Updated 6 years ago
xinjli / phonepiece
phone inventory library
☆16Updated 2 years ago
danijel3 / ClarinStudioKaldi
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Updated 3 years ago
bpopeters / mg2p
Multilingual grapheme-to-phoneme conversion
☆20Updated 7 years ago
desilinguist / swig-srilm
SWIG Wrapper for the SRILM toolkit
☆34Updated 4 years ago
fgnt / LatticeWordSegmentation
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
☆17Updated 8 years ago
njsmith / pysrilm
An extremely simple Python wrapper for the SRI Language Modeling toolkit
☆70Updated 10 years ago
shtoshni / speech_parsing
Code for NAACL 2018 paper "Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information"
☆12Updated 8 years ago
CUNY-CL / citylex
An English lexical database from the Big 🍎, let's go Mets baby love da Mets
☆17Updated 3 months ago
M4t1ss / SoftAlignments
Neural macine translation soft alignment visualisations for web and command line
☆72Updated 3 years ago
CoEDL / kaldi_helpers
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15Updated 5 years ago
ehsanasgari / 1000Langs
Creating super-parallel corpora of more than 1500+ unique languages for NLP research
☆34Updated 2 years ago