bbc / dsrp_bbcavs10k_distributionLinks
Repo for the BBCAVS10k distribution
☆9Updated 7 months ago
Alternatives and similar repositories for dsrp_bbcavs10k_distribution
Users that are interested in dsrp_bbcavs10k_distribution are comparing it to the libraries listed below
Sorting:
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆18Updated last year
- Transformer-based visually grounded speech models☆19Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Updated 2 years ago
- Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".☆55Updated 2 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆20Updated 2 years ago
- DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning☆49Updated last year
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆28Updated 2 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Updated 7 months ago
- ☆18Updated 4 years ago
- ☆41Updated 2 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆27Updated 3 years ago
- A minimum JukeMIR branch for feature extraction.☆32Updated 3 years ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 4 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- ☆36Updated 4 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆25Updated last month
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆37Updated last year
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆54Updated 2 weeks ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆13Updated 11 months ago
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆26Updated 4 months ago
- ☆14Updated 2 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 8 months ago
- Project website for "Telling left from right: Learning spatial correspondence between sight and sound"☆23Updated 3 years ago
- Code for the paper Musical Voice Separation as Link Prediction: Modeling a Musical Perception Task as a Multi-Trajectory Tracking Proble…☆8Updated last year
- Event Relation in Text-to-Audio (TTA) Generation☆20Updated 4 months ago
- experiments about AudioSet☆44Updated last year
- ☆15Updated 4 years ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆42Updated 2 weeks ago
- Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in S…☆17Updated last year
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Updated 2 years ago