sangtaik / STTLinks
wav2vec를 사용한 STT 기능을 사용하여 음성인식 및 PPT 도우미 기능을 추가
☆9Updated 3 years ago
Alternatives and similar repositories for STT
Users that are interested in STT are comparing it to the libraries listed below
Sorting:
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean C…☆16Updated last year
- vits2 backbone with multilingual-bert(한국어 지원)☆26Updated last year
- ☆17Updated 2 months ago
- VALL-E 한국어 버전☆12Updated last year
- ☆19Updated 2 years ago
- Bilingual-TTS (Japanese and Korean)☆31Updated 2 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆34Updated 2 weeks ago
- ☆11Updated 9 months ago
- Few-shot multilingual tts with RVC and Vits☆51Updated 2 years ago
- Updated folk of g2pk☆12Updated last year
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Updated 7 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45Updated 2 years ago
- Korean language support for NNSVS/ENUNU☆28Updated last year
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆28Updated 2 years ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆67Updated 2 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆30Updated 4 years ago
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.☆8Updated 3 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS☆60Updated 3 years ago
- Trends, Tools, News timeline ...☆19Updated 3 months ago
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆68Updated 4 years ago
- Wav2Vec2 finetune and inference code for IITP AI Grand Challenge☆36Updated 3 years ago
- Use openvoice v2 module to do real time tts(text to speech) task for on-device robotics. Trying to inference the model on single board li…☆16Updated 9 months ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Updated last year
- A pakage for crawling audio from Youtube☆42Updated 2 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆14Updated last year
- ☆13Updated last week
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- ☆86Updated 2 years ago