LonelyRider-cs / LING4100_projectLinks
english to IPA translator using openNMT to create the models
☆15Updated 6 years ago
Alternatives and similar repositories for LING4100_project
Users that are interested in LING4100_project are comparing it to the libraries listed below
Sorting:
- English Resource Grammar☆24Updated 2 months ago
- British English pronunciation dictionary☆98Updated 8 years ago
- Spoken Cantonese from Hong Kong.☆30Updated last month
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- BERT Tokenizer with vocabulary tailored for Cantonese☆23Updated 3 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 3 years ago
- 粵文語料篩選器 Cantonese text filter☆41Updated 9 months ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆15Updated 3 years ago
- Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words☆99Updated 4 years ago
- Wikipedia Bilingual Reference Data (English)☆16Updated 9 years ago
- A modern, interlingual wordnet interface for Python☆277Updated this week
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆22Updated last month
- An English-to-Cantonese machine translation model☆53Updated 9 months ago
- Extract and align grammar patterns from English sentences.☆56Updated 3 years ago
- Automatically exported from code.google.com/p/m2m-aligner☆42Updated 9 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆66Updated last week
- cantonese-mandarin unsupervised neural translation for sw project☆28Updated 2 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆25Updated 8 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆39Updated 5 years ago
- rime-cantonese 上游詞表倉庫☆30Updated 2 weeks ago
- CMU dictionary in IPA instead of their subset of Arpabet☆16Updated last year
- Python Finite-State Toolkit☆60Updated 2 weeks ago
- ✒️ LanguageTool integration for Quill.js editors☆17Updated last year
- fastText vectors created from Hong Kong data.☆22Updated 5 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 10 months ago
- Live Audio MFCC Visualization in the browser using Web Audio API - https://pulakk.github.io/Live-Audio-MFCC/tutorial☆41Updated 6 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- The Cantonese Wordnet☆14Updated 2 years ago
- text-to-speech alignment java software☆20Updated 6 years ago