JusperLee / Arxiv-New-Paper-ServerLinks
Arxiv automatically obtains the latest article service.
☆11Updated 5 years ago
Alternatives and similar repositories for Arxiv-New-Paper-Server
Users that are interested in Arxiv-New-Paper-Server are comparing it to the libraries listed below
Sorting:
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆14Updated 2 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Updated 4 years ago
- ICASSP2022 TTS&VC Summary☆14Updated 3 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆75Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 3 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 4 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 5 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14Updated 5 years ago
- ☆44Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆41Updated 3 months ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Updated 5 years ago
- ☆52Updated 4 years ago
- ☆64Updated 3 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated 2 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Updated 2 years ago
- ☆14Updated 3 years ago
- Mutiband version of HIFIGAN☆18Updated 4 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Speech samples and code of BEdit-TTS☆34Updated last year
- ☆15Updated 4 years ago
- video cut powered by AI☆25Updated 2 years ago
- ☆25Updated 3 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆25Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Updated 5 years ago