Fraunhofer-IIS / ODAQ
☆35Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ODAQ
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆28Updated 3 months ago
- Landing Page for All Things Source Separation☆17Updated 2 weeks ago
- ☆33Updated 7 months ago
- ☆40Updated 5 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆88Updated 4 months ago
- Differentiable dynamic range controller in PyTorch.☆45Updated 2 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 8 months ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆72Updated 2 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- ☆29Updated last year
- TS-BSmamba2: A TWO-STAGE BAND-SPLIT MAMBA-2 NETWORK FOR MUSIC SEPARATION☆37Updated 2 months ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆26Updated 6 months ago
- Banquet: A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems☆37Updated this week
- ☆79Updated last year
- ☆61Updated 7 months ago
- Implementation of FiNS model for RIR estimation☆25Updated last year
- ☆34Updated 5 months ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆38Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆67Updated last week
- ☆42Updated this week
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆31Updated 10 months ago
- Repo for source code of EBEN: Extreme Bandwidth Extension Network☆69Updated 2 months ago
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆20Updated 7 months ago
- ☆15Updated 4 months ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆34Updated last year
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆32Updated 2 months ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆33Updated 2 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆21Updated 3 months ago
- ☆21Updated 7 months ago