ttop32 / wav2vec2-live-japanese-translator
real time japanese speech recognition translator using wav2vec2
☆37Updated 2 years ago
Alternatives and similar repositories for wav2vec2-live-japanese-translator:
Users that are interested in wav2vec2-live-japanese-translator are comparing it to the libraries listed below
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆68Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated this week
- Putting flows on top of neural transducers for better TTS☆62Updated last month
- ☆18Updated 4 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆19Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆166Updated last year
- context labels and pronunciation data for JSUT corpus☆68Updated 3 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 2 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- Monotonic Alignment Search☆90Updated 2 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆36Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆249Updated last year
- ☆23Updated 7 months ago
- Nue-ASR inference code by rinna Co., Ltd.☆32Updated 8 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- a Frontier Japanese Speech Generation net☆28Updated 3 weeks ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Updated 2 years ago
- A public domain single speaker Japanese speech dataset☆50Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆16Updated last year
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆73Updated last year
- Demo for 2022 Interspeech☆29Updated 2 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆14Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆82Updated 3 months ago