bluesignum / Audio-SentenceSplit
With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the data from Google Speech Recognition API
☆10Updated 6 years ago
Alternatives and similar repositories for Audio-SentenceSplit:
Users that are interested in Audio-SentenceSplit are comparing it to the libraries listed below
- A pakage for crawling audio from Youtube☆41Updated last year
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Updated 5 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated last year
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆88Updated 3 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆27Updated last year
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆29Updated 2 months ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 10 months ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆44Updated last year
- Repository for "Training Audio Captioning Models without Audio"☆9Updated last year
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Updated 7 years ago
- Emotional Speech Conversion using Nonparallel Data☆16Updated 6 years ago
- ☆42Updated 3 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157Updated last year
- Diffusion Model for Voice Conversion☆17Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆20Updated 11 months ago
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- ☆24Updated 7 months ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆106Updated 2 months ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆39Updated 3 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- ☆55Updated last year
- Korean ASR Corpus generated from TEDx talks☆27Updated 6 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆84Updated 2 years ago
- ☆66Updated 4 months ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year