deepvk / museLinks
π΅ muse: Music Separation
β10Updated last year
Alternatives and similar repositories for muse
Users that are interested in muse are comparing it to the libraries listed below
Sorting:
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 5 months ago
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β17Updated 5 months ago
- β20Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β20Updated 4 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)β12Updated last year
- Whisper Speech Quality Assessment (WhiSQA)β16Updated 2 weeks ago
- Forced alignment decoder for Whisper.β14Updated last year
- β23Updated this week
- Official repository of Wavehax vocoderβ55Updated 3 months ago
- β44Updated 3 months ago
- β48Updated 4 months ago
- β17Updated 9 months ago
- β‘ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.β33Updated last year
- StyleTTS2 + Vocos as a Decoderβ13Updated 7 months ago
- Viterbi decoding in PyTorchβ37Updated last month
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.β10Updated 7 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)β12Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fastβ¦β43Updated this week
- Data manipulation and transformation for audio signal processing, powered by PyTorchβ10Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessmentβ37Updated 5 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ10Updated 5 months ago
- Sequence alignement methods with helpers for PyTorch.β24Updated 2 years ago
- A toolkit dedicate for speech evaluation.β24Updated last year
- Implementation of vocoders empowered with pytorch lightningβ18Updated last year
- β13Updated 7 months ago
- β13Updated 2 years ago
- A simple command line tool to calculate WER for ASR.β14Updated last year
- β25Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Modelβ13Updated 6 months ago