earthspecies / beans
BEANS: The Benchmark of Animal Sounds
☆71Updated last year
Related projects: ⓘ
- AVES: Animal Vocalization Encoder based on Self-Supervision☆76Updated 2 months ago
- ☆17Updated 2 months ago
- ☆29Updated this week
- ☆78Updated last year
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆27Updated 2 years ago
- ☆71Updated last year
- ☆36Updated 3 months ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆20Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Code for the "NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks" paper.☆34Updated 2 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆32Updated last week
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆58Updated last year
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆76Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- PyTorch wrappers for using your model in audacity!☆172Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆46Updated 2 years ago
- Deep Performer: Score-to-audio music performance synthesis☆41Updated last year
- ☆62Updated 3 weeks ago
- PyTorch Dataset for Speech and Music audio☆73Updated 2 months ago
- An open-source package providing standardized tools for sound event analysis and data management.☆16Updated 3 weeks ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆25Updated 4 months ago
- Project for MIDI to Audio Synthesis☆19Updated last year
- SDX23 startkit for the Demucs baselines.☆23Updated last year
- Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.☆35Updated last month
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆36Updated 10 months ago
- Realtime (streaming) DDSP in PyTorch compatible with neutone☆44Updated last year
- ☆10Updated last year
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆52Updated last year
- Comparison of Python audio resampling implementations☆43Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆101Updated last year