guxm2021 / SVT_SpeechBrainLinks
[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆24Updated 10 months ago
Alternatives and similar repositories for SVT_SpeechBrain
Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below
Sorting:
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆28Updated this week
- ☆18Updated 2 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Timbre Transfer using Denoising Diffusion Implicit Models (ISMIR 2023)☆27Updated 3 months ago
- Phonemes and durations labeling based on whisper small☆11Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆25Updated last year
- ☆16Updated 2 months ago
- Official implementation of Self-Remixing☆14Updated last year
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆19Updated 6 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆17Updated 11 months ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆40Updated 4 months ago
- ☆44Updated 8 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 3 weeks ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 2 years ago
- ☆20Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 2 weeks ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- GPT for FACodec☆13Updated last year
- Official source codes of airsep☆36Updated last year
- Code for the paper Musical Voice Separation as Link Prediction: Modeling a Musical Perception Task as a Multi-Trajectory Tracking Proble…☆8Updated last year
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆13Updated 11 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- ☆18Updated 3 years ago
- ☆10Updated 3 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago