bluesignum / Audio-SentenceSplitLinks
With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the data from Google Speech Recognition API
☆10Updated 6 years ago
Alternatives and similar repositories for Audio-SentenceSplit
Users that are interested in Audio-SentenceSplit are comparing it to the libraries listed below
Sorting:
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Updated 7 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Updated 6 years ago
- ☆27Updated 4 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Updated 2 years ago
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆20Updated last year
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Updated 2 years ago
- ☆14Updated 2 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆22Updated last month
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 4 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Updated last year
- A pakage for crawling audio from Youtube☆42Updated 2 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆14Updated last year
- Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS☆64Updated 3 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157Updated 2 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆38Updated 2 months ago
- ☆25Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated last year
- Transcription and diarization (speaker identification)☆34Updated 2 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆31Updated 4 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆43Updated last year
- ☆45Updated 5 months ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆50Updated 4 years ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Updated 3 years ago
- Gaze estimation from 2D image☆12Updated last year
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Updated 4 years ago