kyegomez / Audio-xLSTMsLinks
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆19Updated 3 weeks ago
Alternatives and similar repositories for Audio-xLSTMs
Users that are interested in Audio-xLSTMs are comparing it to the libraries listed below
Sorting:
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆25Updated last year
- ☆31Updated 7 months ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆35Updated 3 weeks ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆36Updated 4 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆36Updated 8 months ago
- Codebase and project page for EDMSound☆35Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 3 weeks ago
- ☆51Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆44Updated last week
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆15Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆22Updated 3 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
- ☆27Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆50Updated 8 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆33Updated 8 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆81Updated 3 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆33Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- ☆18Updated 6 months ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated 2 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆34Updated 2 weeks ago
- Fast and differentiable hidden Markov model in C++☆18Updated 2 years ago
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆39Updated 6 months ago
- ☆11Updated last year