Hertin / WavPrompt
☆36Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for WavPrompt
- ☆36Updated 3 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆23Updated 2 months ago
- ☆31Updated last year
- ☆53Updated 3 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- ☆15Updated 3 years ago
- Transformer-based visually grounded speech models☆19Updated 2 years ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆81Updated last year
- multilingual speech aligner☆71Updated 11 months ago
- ☆12Updated last year
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆38Updated last week
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- experiments about AudioSet☆43Updated last year
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆19Updated 3 years ago
- ☆48Updated this week
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- ☆17Updated 6 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆25Updated 11 months ago
- ☆22Updated 4 months ago
- ☆64Updated 2 years ago
- ☆20Updated 3 years ago
- ☆25Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago