bluesignum / Audio-SentenceSplitLinks
With one whole audio and corresponding text, the audio can be split line by line and saved with exact sentence using comparison with the data from Google Speech Recognition API
☆10Updated 6 years ago
Alternatives and similar repositories for Audio-SentenceSplit
Users that are interested in Audio-SentenceSplit are comparing it to the libraries listed below
Sorting:
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆22Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆20Updated last year
- Diffusion Model for Voice Conversion☆17Updated 3 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Updated 6 years ago
- This is a winter of code project aimed at speech enhancement of text to speech models.☆24Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆35Updated last year
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 4 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆13Updated last year
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Updated 3 weeks ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆33Updated last year
- ☆27Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Updated 2 years ago
- ☆14Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Updated 2 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆37Updated last month
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated 2 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- ☆19Updated 8 months ago
- A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)☆13Updated 3 years ago
- Hangul pronunciation and romanisation based on Wiktionary ko-pron lua module☆21Updated 7 years ago
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Updated 4 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆22Updated last month
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30Updated 2 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆13Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Updated 5 years ago
- A pakage for crawling audio from Youtube☆42Updated 2 years ago