peterolson / aws-chinese-forced-alignment
Tool for aligning Chinese transcripts with audio using the AWS transcribe service
☆14Updated 2 years ago
Alternatives and similar repositories for aws-chinese-forced-alignment:
Users that are interested in aws-chinese-forced-alignment are comparing it to the libraries listed below
- 粵文語料篩選器 Cantonese text filter☆38Updated last month
- Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.☆111Updated last year
- CLDR text segmentation for JavaScript☆38Updated 10 months ago
- The CMU Pronouncing Dictionary converted to IPA☆80Updated 5 years ago
- A tool for automatic phoneme transcription☆157Updated last year
- Tokenizes Chinese texts into words.☆96Updated 2 years ago
- Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and c…☆43Updated 2 years ago
- Wikipedia Bilingual Reference Data (English)☆15Updated 8 years ago
- for splitting words into their component syllables☆52Updated 8 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 11 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP☆87Updated 3 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆35Updated 4 years ago
- Chinese (zh-cnm) opendata audio files for 8,596 hsk words and 1,707 syllabs.☆45Updated 3 years ago
- A collection of modules and utilities for doing things with phonemes.☆50Updated 2 years ago
- A tool to find grammar patterns in Chinese text☆26Updated 5 years ago
- Hong Kong Cantonese Corpus of transcribed speech (spontaneous speech, radio programmes and a monologue).☆56Updated last year
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Updated 4 years ago
- Transformers for Cantonese☆56Updated 4 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆44Updated 4 years ago
- Identification and conversion functions for Chinese text processing☆59Updated 4 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆87Updated last year
- A script for audio/transcript alignment. Fork of p2fa.☆68Updated 7 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆33Updated last year
- British English pronunciation dictionary☆92Updated 7 years ago
- Unconjugate conjugated Japanese verbs.☆23Updated 10 months ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆241Updated 5 years ago
- Python interface for forced audio alignment using HTK and SoX☆335Updated 4 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆76Updated 3 years ago
- Javascript library for reading and writing textgrid files☆16Updated 2 years ago
- IPA Keyboard. International Phonetic Alphabet Symbols Web and Desktop Application built using Vue.js, Gulp and Node-Webkit☆21Updated 6 months ago