slp-rl / salmon
The official code for the SALMonπ£ benchmark
β34Updated this week
Related projects: β
- β44Updated last week
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995β17Updated 3 weeks ago
- Please visit https://thuhcsi.github.io/SnakeGAN/β36Updated last year
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language Mβ¦β17Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11β¦β40Updated 2 months ago
- A toolkit dedicate for speech evaluation.β18Updated last month
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.β30Updated 7 months ago
- β41Updated 2 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervβ¦β29Updated 8 months ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"β27Updated 9 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986β25Updated 4 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAMβ14Updated 6 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ10Updated 9 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).β18Updated 3 months ago
- Official code of ElasticAST (Interspeech 2024 paper)β20Updated last month
- β26Updated 3 months ago
- β16Updated 8 months ago
- [Official Implementation] Acoustic Autoregressive Modeling π₯β52Updated 3 weeks ago
- β44Updated last year
- This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (β¦β27Updated 2 years ago
- β57Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictorβ17Updated last year
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistencyβ45Updated 2 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generationβ23Updated 6 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literatureβ44Updated last month
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"β21Updated 5 months ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learningβ47Updated 8 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.β22Updated 11 months ago
- β37Updated 3 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"β22Updated 3 months ago