Daniel-Heo / realtrans
Real-time voice recognition & translation
☆14Updated this week
Related projects ⓘ
Alternatives and complementary repositories for realtrans
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean C…☆13Updated 10 months ago
- wav2vec를 사용한 STT 기능을 사용하여 음성인식 및 PPT 도우미 기능을 추가☆9Updated 2 years ago
- vits2 backbone with multilingual-bert(한국어 지원)☆25Updated 7 months ago
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆37Updated 8 months ago
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated 11 months ago
- Few-shot multilingual tts with RVC and Vits☆49Updated last year
- ☆11Updated 2 weeks ago
- Bilingual-TTS (Japanese and Korean)☆28Updated last year
- Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS☆53Updated 2 years ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 7 months ago
- ☆14Updated last year
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆27Updated last year
- ☆10Updated last year
- ☆10Updated 2 months ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆32Updated 7 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆10Updated 5 years ago
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- ☆16Updated 3 weeks ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆63Updated last year
- Korean Text To Speech Project: Using Tacotron1, Tacotron2, Wavenet and Melgan☆31Updated 6 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆25Updated 2 years ago
- Korean language support for NNSVS/ENUNU☆27Updated 7 months ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆43Updated last year
- Korean Streaming ASR(with Denoiser and Conformer CTC)☆19Updated 6 months ago
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 4 months ago
- Diffusion Model for Voice Conversion☆15Updated 2 years ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆75Updated 8 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆12Updated 6 months ago
- Korean TTS, Tacotron2, Wavenet☆164Updated 4 years ago
- Stable Diffusion Studio☆15Updated 2 months ago