deepvk / museLinks
π΅ muse: Music Separation
β11Updated last year
Alternatives and similar repositories for muse
Users that are interested in muse are comparing it to the libraries listed below
Sorting:
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 4 months ago
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β17Updated 4 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β20Updated 4 months ago
- β43Updated 2 months ago
- Forced alignment decoder for Whisper.β14Updated last year
- Whisper Speech Quality Assessment (WhiSQA)β15Updated 10 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Modelβ13Updated 6 months ago
- β18Updated last year
- β13Updated 6 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ10Updated 4 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)β12Updated last year
- β25Updated last year
- β17Updated 9 months ago
- A simple command line tool to calculate WER for ASR.β14Updated 11 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)β12Updated last year
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.orβ¦β16Updated 2 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open β¦β21Updated 3 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrievalβ13Updated 3 months ago
- Official repository of Wavehax vocoderβ54Updated 2 months ago
- Collection of scripts from mHuBERT-147.β30Updated 10 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Updated last year
- Official code of SenSE.β18Updated last week
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representationsβ20Updated 2 weeks ago
- Unofficial implementation of wavenext vocoderβ50Updated last year
- β12Updated this week
- β48Updated 3 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filteringβ21Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"β27Updated 3 weeks ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Updated last year
- Viterbi decoding in PyTorchβ37Updated 3 weeks ago