backspacetg / distilXLSR
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆10Updated last year
Alternatives and similar repositories for distilXLSR:
Users that are interested in distilXLSR are comparing it to the libraries listed below
- A toolkit dedicate for speech evaluation.☆19Updated 3 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆12Updated last month
- ☆27Updated 6 months ago
- ☆10Updated last month
- ☆23Updated this week
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆31Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 4 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- A neural speech codec based on discrete WavLM representations☆22Updated 4 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆28Updated 3 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 3 months ago
- ☆21Updated last year
- ☆54Updated 2 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆23Updated 3 months ago
- Streaming Vocos☆19Updated last week
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆11Updated last month
- ☆53Updated 11 months ago
- ☆25Updated 8 months ago
- Just another FastSpeech 2 but cleaner code :)☆25Updated 6 months ago
- ☆19Updated last year
- Source code for DM-Codec.☆34Updated 3 months ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆59Updated 9 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆31Updated 2 months ago
- (WIP)long form speech generatoins☆29Updated last month
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆49Updated 2 months ago
- ☆15Updated 3 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆14Updated 4 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆23Updated 4 months ago
- ☆18Updated 8 months ago