kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆17Updated this week
Alternatives and similar repositories for Audio-xLSTMs:
Users that are interested in Audio-xLSTMs are comparing it to the libraries listed below
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆23Updated 9 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated last month
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 4 months ago
- Project for MIDI to Audio Synthesis☆22Updated last year
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆19Updated 5 months ago
- ☆16Updated 4 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆31Updated 2 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆43Updated 5 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated 2 months ago
- GPT for FACodec☆13Updated 10 months ago
- ☆14Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 5 months ago
- Digital Speech Processing in PyTorch.☆14Updated 2 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆20Updated 3 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Landing Page for All Things Source Separation☆19Updated 2 months ago
- ☆21Updated 9 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆14Updated 5 months ago
- Official implementation of Self-Remixing☆13Updated 11 months ago
- ☆10Updated 2 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆11Updated 7 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆11Updated last year
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆33Updated 3 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 4 months ago
- Viterbi decoding in PyTorch☆27Updated 3 months ago
- Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial …☆14Updated 2 weeks ago