π΅ muse: Music Separation
β11Feb 14, 2024Updated 2 years ago
Alternatives and similar repositories for muse
Users that are interested in muse are comparing it to the libraries listed below
Sorting:
- β45Jun 11, 2025Updated 9 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transformβ14Aug 25, 2023Updated 2 years ago
- π VITRina: VIsual Token Representationsβ11Jun 15, 2023Updated 2 years ago
- β‘ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.β35Jan 19, 2024Updated 2 years ago
- π METR: Message Enhanced Tree-Ringβ21Aug 19, 2024Updated last year
- β130Aug 19, 2024Updated last year
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jaxβ16Jun 16, 2024Updated last year
- Normalize Text in Russianβ28Nov 7, 2023Updated 2 years ago
- β13Oct 11, 2024Updated last year
- Yet Another Config Library for C++β10Sep 21, 2018Updated 7 years ago
- Target speaker automatic speech recognition (TS-ASR)β12Oct 14, 2023Updated 2 years ago
- β14Oct 3, 2025Updated 5 months ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesisβ44Jul 24, 2023Updated 2 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequβ¦β28Sep 20, 2025Updated 6 months ago
- 3D game engineβ15Nov 17, 2021Updated 4 years ago
- Improving Neural Text Generation with Reinforcement Learningβ23Jan 13, 2021Updated 5 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.β13Sep 27, 2024Updated last year
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"β169Jan 16, 2025Updated last year
- Russian phonetical transcriptionβ11Nov 19, 2025Updated 4 months ago
- Base for building Figma plugins with Reactβ16Jul 20, 2022Updated 3 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)β12Mar 12, 2024Updated 2 years ago
- Neural model for prediction of stress position in Russian wordsβ13Jun 22, 2025Updated 9 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"β21Jun 7, 2025Updated 9 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generationβ32Mar 8, 2024Updated 2 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) β python package for placing stress in Russian text using RNN (BiLSTβ¦β45Aug 7, 2024Updated last year
- Simple fluid simulation right in your terminalβ49Mar 14, 2026Updated last week
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.β23Nov 23, 2018Updated 7 years ago
- Simple implement of ECS on C++β16May 29, 2018Updated 7 years ago
- β13Dec 7, 2022Updated 3 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based β¦β65Aug 24, 2025Updated 6 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activitβ¦β22Jan 10, 2025Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"β35Oct 23, 2025Updated 4 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Simple Java interface to handle Bitcoin transactionsβ12May 7, 2013Updated 12 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024β12Apr 15, 2025Updated 11 months ago
- C++ version of pyannote audio overlapped speech detection pipelineβ13Feb 14, 2024Updated 2 years ago
- β20Sep 2, 2024Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β30May 27, 2023Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verificationβ24Sep 22, 2024Updated last year