kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆12Updated this week
Related projects: ⓘ
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆21Updated 4 months ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆11Updated 9 months ago
- Codebase and project page for EDMSound☆29Updated 9 months ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆17Updated 4 months ago
- ☆15Updated last year
- Unconditional music synthesis using a diffusion model in the STFT domain☆12Updated 2 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 4 months ago
- Audio Entailment: Deductive Reasoning for Audio Understanding☆10Updated last month
- Project for MIDI to Audio Synthesis☆19Updated last year
- ☆23Updated last year
- ☆21Updated last year
- GPT for FACodec☆13Updated 5 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆22Updated this week
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆22Updated last week
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated 9 months ago
- Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression☆14Updated 10 months ago
- ☆18Updated 4 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Official source codes of airsep☆33Updated 5 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆18Updated last month
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆16Updated 3 weeks ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated last month
- Viterbi decoding in PyTorch☆23Updated 3 weeks ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆40Updated last month
- ☆17Updated this week
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆20Updated 2 years ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆14Updated 2 years ago