feldberlin / timething
Timething is a library for aligning text transcripts with their audio recordings.
☆119Updated 5 months ago
Alternatives and similar repositories for timething
Users that are interested in timething are comparing it to the libraries listed below
Sorting:
- ☆80Updated 11 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆162Updated 2 weeks ago
- Python forced alignment☆89Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆112Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆160Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- Charsiu: A neural phonetic aligner.☆299Updated 2 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆148Updated 10 months ago
- Universal multilingual automatic speech transcription into IPA☆64Updated 2 months ago
- Segment an audio file and obtain utterance alignments. (Python package)☆335Updated last year
- ☆36Updated 10 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆91Updated last year
- A sequence-to-sequence voice conversion toolkit.☆97Updated 10 months ago
- Putting flows on top of neural transducers for better TTS☆62Updated last month
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆20Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆149Updated 11 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆136Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated last month
- ☆84Updated 7 months ago
- A small wrapper package around whisper-timestamped. Create force-aligned transcription TextGrids from raw audio!☆16Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆34Updated last year
- This is the M-AILABS Speech Dataset☆63Updated 5 months ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆225Updated 3 years ago
- A non-native English corpus for pronunciation scoring task☆132Updated 9 months ago
- A python package for deep multilingual punctuation prediction.☆123Updated 8 months ago
- Simple Diarization model☆47Updated last year
- ☆78Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated last month
- ☆38Updated 3 years ago