microsoft / NeuralSpeech
☆1,405Updated 11 months ago
Alternatives and similar repositories for NeuralSpeech:
Users that are interested in NeuralSpeech are comparing it to the libraries listed below
- Unified-Modal Speech-Text Pre-Training for Spoken Language Processing☆1,262Updated 9 months ago
- Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch☆1,304Updated last year
- YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone☆935Updated 2 months ago
- Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch☆2,488Updated 2 weeks ago
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)☆471Updated 11 months ago
- ☆987Updated last week
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆1,914Updated last year
- A Generative Flow for Text-to-Speech via Monotonic Alignment Search☆675Updated 2 years ago
- Large, modern dataset for speech recognition☆657Updated 11 months ago
- PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html☆2,084Updated last year
- List of speech synthesis papers.☆1,017Updated last year
- A 10000+ hours dataset for Chinese speech recognition☆515Updated last year
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆940Updated 4 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆618Updated last week
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,588Updated 9 months ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆570Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,719Updated last year
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆464Updated 2 weeks ago
- chinese speech pretrained models☆1,065Updated 5 months ago
- Library for Textless Spoken Language Processing☆531Updated last year
- unofficial vits2-TTS implementation in pytorch☆505Updated 10 months ago
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,261Updated 6 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆653Updated last month
- Tools for handling speech data in machine learning projects.☆975Updated last month
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.☆574Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆520Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,023Updated 6 months ago
- Chinese text normalization for speech processing☆646Updated last year
- Contrastive Language-Audio Pretraining☆1,511Updated 2 months ago
- The dataset of Speech Recognition☆400Updated last month