proger / haloop
Agent toolkit for 100 hours of speech and 10 GiB of text
☆13Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for haloop
- g2p ID: Indonesian Grapheme-to-Phoneme Converter☆13Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆26Updated last year
- phone inventory library☆15Updated last year
- This is the M-AILABS Speech Dataset☆22Updated 4 months ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Open Source Crimean Tatar Text-to-Speech datasets☆13Updated last year
- ☆56Updated last year
- ☆17Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆39Updated 3 months ago
- Use quantized versions of Whisper to speed up inference☆11Updated last month
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated 2 months ago
- ☆22Updated 3 years ago
- ☆19Updated 5 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- The VoxTube dataset official repository☆61Updated 9 months ago
- ☆32Updated 2 months ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆20Updated last month
- [NOT-in-Progress] PyTorch implementation of "Pre-Alignment Guided Attention for Improving Training Efficiency and Model Stability in End-…☆9Updated last year
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆21Updated 2 months ago
- ☆20Updated 6 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- ☆8Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆35Updated last month
- Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitt…☆16Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago