🎵 muse: Music Separation
☆11Feb 14, 2024Updated 2 years ago
Alternatives and similar repositories for muse
Users that are interested in muse are comparing it to the libraries listed below
Sorting:
- ☆44Jun 11, 2025Updated 8 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆14Aug 25, 2023Updated 2 years ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Jan 19, 2024Updated 2 years ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Normalize Text in Russian☆28Nov 7, 2023Updated 2 years ago
- ☆13Oct 11, 2024Updated last year
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆27Sep 20, 2025Updated 5 months ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Sep 27, 2024Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- 🚜 METR: Message Enhanced Tree-Ring☆21Aug 19, 2024Updated last year
- ☆21Mar 7, 2025Updated 11 months ago
- ☆19Jan 8, 2025Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆43Aug 7, 2024Updated last year
- ☆130Aug 19, 2024Updated last year
- T5-based (russian) text normalization☆25Jan 25, 2024Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 4 months ago
- ☆20Sep 2, 2024Updated last year
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆24Jan 9, 2024Updated 2 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Данные 6-го издания «Грамматического словаря русского язы ка» А. А. Зализняка (2010) в виде текстовых файлов☆25Sep 17, 2024Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆24Dec 20, 2022Updated 3 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆32Mar 8, 2024Updated last year
- ☆28Nov 15, 2023Updated 2 years ago
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆64Aug 24, 2025Updated 6 months ago