halsay / ASR-TTS-paper-daily
Update ASR paper everyday
☆31Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ASR-TTS-paper-daily
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆44Updated last month
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆38Updated 3 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆44Updated last week
- ConMamba for Automatic Speech Recognition☆44Updated 3 months ago
- ☆22Updated 4 months ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆19Updated last month
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆37Updated last month
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆94Updated last year
- ☆64Updated last year
- ☆48Updated last year
- The open source code for SimpleSpeech series☆108Updated last month
- ☆17Updated 3 months ago
- Clustering-based methods for overlapping diarization☆68Updated 10 months ago
- Streaming Audiotransformers for online Audio tagging☆41Updated 4 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 10 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆10Updated 10 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- ☆27Updated 7 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆81Updated this week
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆58Updated 7 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 3 months ago
- ☆48Updated 2 weeks ago
- Official repository of NeXt-TDNN for speaker verification☆56Updated last month
- The official implementation of EmoSphere-TTS☆81Updated 3 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆41Updated last week
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆34Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆17Updated last month
- ☆20Updated 9 months ago
- ☆57Updated 2 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago