sangtaik / STTLinks

wav2vec를 사용한 STT 기능을 사용하여 음성인식 및 PPT 도우미 기능을 추가

☆9

Alternatives and similar repositories for STT

Users that are interested in STT are comparing it to the libraries listed below

Sorting:

ORI-Muchim / MB-iSTFT-VITS-Korean
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean C…
☆16Updated last year
jwj7140 / Bert-VITS2-Korean
vits2 backbone with multilingual-bert(한국어 지원)
☆26Updated last year
nc-ai / speech
☆17Updated 2 months ago
kdrkdrkdr / VALL-E-Korean
VALL-E 한국어 버전
☆12Updated last year
seongmin-mun / KoG2Padvanced
☆19Updated 2 years ago
kdrkdrkdr / JK-VITS
Bilingual-TTS (Japanese and Korean)
☆31Updated 2 years ago
stannam / hangul_to_ipa
A dash app that transcribes 한글 into [hɑŋɡɯl].
☆34Updated 2 weeks ago
etri / kmsav
☆11Updated 9 months ago
kdrkdrkdr / RVC-VITS
Few-shot multilingual tts with RVC and Vits
☆51Updated 2 years ago
tenebo / g2pk2
Updated folk of g2pk
☆12Updated last year
homink / speech.ko
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Updated 7 years ago
Jackson-Kang / MFARunner
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45Updated 2 years ago
Coda-SVS / nnsvs-korean-support
Korean language support for NNSVS/ENUNU
☆28Updated last year
hwRG / End-to-End-TTS-Fine-Tune
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
☆28Updated 2 years ago
misakiudon / MB-iSTFT-VITS-multilingual
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…
☆67Updated 2 years ago
dobby-seo / kosr
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆30Updated 4 years ago
kdrkdrkdr / JA2ML-VITS
Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)
☆3Updated last year
hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
☆8Updated 3 years ago
SMART-TTS / SMART-Multi-Speaker-Style-TTS
Multi-speaker & Multi-style TTS
☆29Updated last year
ttop32 / coqui_tts_korea
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
☆60Updated 3 years ago
knlee-voice / AI.Tech
Trends, Tools, News timeline ...
☆19Updated 3 months ago
SoonbeomChoi / BEGANSing
Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN
☆68Updated 4 years ago
voithru / wav2vec2_finetune
Wav2Vec2 finetune and inference code for IITP AI Grand Challenge
☆36Updated 3 years ago
Nyan-SouthKorea / RealTime_zeroshot_TTS_ko
Use openvoice v2 module to do real time tts(text to speech) task for on-device robotics. Trying to inference the model on single board li…
☆16Updated 9 months ago
ORI-Muchim / PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆76Updated last year
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated 2 years ago
kaistmm / voxceleb-disentangler
[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…
☆14Updated last year
reppy4620 / x-vits
☆13Updated last week
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Updated 4 years ago
JoungheeKim / K-wav2vec
☆86Updated 2 years ago