bluesignum / Audio-SentenceSplitLinks
With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the data from Google Speech Recognition API
☆10Updated 6 years ago
Alternatives and similar repositories for Audio-SentenceSplit
Users that are interested in Audio-SentenceSplit are comparing it to the libraries listed below
Sorting:
- ☆27Updated 4 years ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated 2 years ago
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Updated 4 years ago
- ☆14Updated 2 years ago
- ☆46Updated 6 months ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Updated 2 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Updated last year
- Workflow for forced alignment between languages☆23Updated 3 weeks ago
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Updated 7 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Updated 5 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Updated last week
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆36Updated last year
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆48Updated last year
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Updated 6 years ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Updated 4 years ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆42Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆23Updated 2 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆28Updated 3 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆20Updated 2 years ago
- Diffusion Model for Voice Conversion☆17Updated 3 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Updated last year
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50Updated 4 years ago
- A pakage for crawling audio from Youtube☆42Updated 2 years ago