bluesignum / Audio-SentenceSplitLinks
With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the data from Google Speech Recognition API
☆10Updated 6 years ago
Alternatives and similar repositories for Audio-SentenceSplit
Users that are interested in Audio-SentenceSplit are comparing it to the libraries listed below
Sorting:
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆21Updated 2 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆35Updated last month
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Updated 5 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- Workflow for forced alignment between languages☆20Updated last year
- ☆45Updated last month
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Updated 6 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆20Updated last year
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 3 months ago
- Breaks a word into syllables using an LSTM-based neural network.☆20Updated 2 years ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 2 years ago
- Simple synthetic audio feature extractor☆35Updated 8 months ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆12Updated last year
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆28Updated 2 years ago
- ☆14Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆17Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Updated 5 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆30Updated last year
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆46Updated 4 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆29Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Reference☆34Updated last year
- Diffusion Model for Voice Conversion☆17Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago