deepvk / museLinks
π΅ muse: Music Separation
β11Updated last year
Alternatives and similar repositories for muse
Users that are interested in muse are comparing it to the libraries listed below
Sorting:
- β29Updated 2 weeks ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)β12Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 4 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β20Updated 3 months ago
- β25Updated last year
- β43Updated 2 months ago
- Forced alignment decoder for Whisper.β14Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speechβ10Updated 4 months ago
- Whisper Speech Quality Assessment (WhiSQA)β15Updated 9 months ago
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β17Updated 3 months ago
- Unofficial implementation of wavenext vocoderβ49Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ15Updated last year
- Official repository of Wavehax vocoderβ53Updated last month
- source code of EfficientTTS 2β18Updated last year
- A simple command line tool to calculate WER for ASR.β14Updated 11 months ago
- Sequence alignement methods with helpers for PyTorch.β24Updated 2 years ago
- β17Updated last year
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representationsβ19Updated 4 months ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Modelβ12Updated 5 months ago
- Viterbi decoding in PyTorchβ37Updated last week
- StyleTTS2 + Vocos as a Decoderβ13Updated 5 months ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesisβ¦β14Updated 6 months ago
- β48Updated 2 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11β¦β46Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fastβ¦β37Updated last week
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORMβ18Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)β12Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"β24Updated last week
- β17Updated 8 months ago