wenet-e2e / wecut
video cut powered by AI
☆25Updated 2 years ago
Alternatives and similar repositories for wecut:
Users that are interested in wecut are comparing it to the libraries listed below
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Decoders from Kaldi using OpenFst☆28Updated 3 months ago
- ☆20Updated 6 months ago
- ICASSP2022 TTS&VC Summary☆14Updated 2 years ago
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆51Updated 9 months ago
- ☆13Updated 3 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 3 years ago
- RepVgg + HiFiGAN☆34Updated 2 years ago
- Based on https://github.com/fatchord/WaveRNN☆24Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 4 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- ☆26Updated 4 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- ☆43Updated 4 years ago
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 8 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 7 months ago
- Simulation of parallel synthesis with LPCNet vocoder☆14Updated 4 years ago
- ☆36Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- ☆56Updated 2 years ago
- ☆20Updated 5 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 months ago
- ☆25Updated 3 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year