giovana-morais / stemeLinks
[ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation
☆13Updated 2 years ago
Alternatives and similar repositories for steme
Users that are interested in steme are comparing it to the libraries listed below
Sorting:
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆23Updated 4 years ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Updated 3 years ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Updated last month
- music semantic understanding evaluation benchmark☆25Updated 2 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 3 years ago
- ☆20Updated 8 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- A piano music dataset with Audio, Symbolic and Text labels☆33Updated 8 months ago
- ☆32Updated 3 years ago
- ☆20Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 3 years ago
- ☆19Updated 4 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated last year
- ☆16Updated 7 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Updated 3 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Updated 2 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆20Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Updated this week
- Sing any popular song with your voice☆11Updated 3 years ago
- The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).☆24Updated this week
- Chorale Music Separation Dataset and Model Framework☆40Updated 2 years ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆42Updated 11 months ago
- ☆27Updated last year
- ☆10Updated 2 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- ☆15Updated 4 years ago
- Using Word embeddings for automatic EQ mixing☆13Updated 3 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago