open-dict-data / wikidict-enLinks
Wikipedia Bilingual Reference Data (English)
☆15Updated 9 years ago
Alternatives and similar repositories for wikidict-en
Users that are interested in wikidict-en are comparing it to the libraries listed below
Sorting:
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 11 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated last week
- English Resource Grammar☆22Updated last week
- Software for phonetic transcription of English and Finnish, and IPA tools☆15Updated 9 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 6 months ago
- The Unicode Cookbook for Linguists☆56Updated 4 years ago
- British English pronunciation dictionary☆96Updated 7 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆82Updated 3 months ago
- An even smaller speech recognizer / force aligner☆35Updated 8 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- The zhong [|] Chinese grammars☆15Updated 3 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 months ago
- CMU dictionary in IPA instead of their subset of Arpabet☆16Updated 11 months ago
- ☆22Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Updated 9 years ago
- Natural Language Inflection in English☆11Updated 3 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- A C++ library implementing fast language models estimation using the 1-Sort algorithm.☆17Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- ☆10Updated 4 years ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆17Updated this week
- Helsinki Finite-State Technology (library and application suite)☆133Updated 3 months ago
- English web corpus with 4M tokens and several annotation types☆26Updated 2 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year