yukara-ikemiya / Swin-Transformer-1dLinks
PyTorch implementation of Swin Transformer for 1-dimensional data
☆12Updated last year
Alternatives and similar repositories for Swin-Transformer-1d
Users that are interested in Swin-Transformer-1d are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆36Updated last year
- ☆10Updated 3 years ago
- AudioLDM training, finetuning, evaluation and inference.☆13Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆34Updated last month
- [Neurips'24 Spotlight] Official code for "Acoustic Volume Rendering for Neural Impulse Response Fields"☆37Updated 5 months ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆45Updated 3 weeks ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 9 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated 2 years ago
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆21Updated 7 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- Accompanying code for our paper "Point Cloud Audio Processing"☆19Updated 3 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆31Updated last year
- An 1D optimal transport inspired loss function in the spectral domain. Can be used for improving frequency localization/estimation in dif…☆21Updated 2 weeks ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆41Updated last year
- Audio-Visual Room Impulse Response Estimation☆17Updated 11 months ago
- ☆29Updated last year
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆31Updated 4 months ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆70Updated 10 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆45Updated 3 months ago
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆20Updated 10 months ago
- A repo that builds text to music datasets from scratch☆22Updated last month
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆18Updated 11 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Separate Anything in Audio with Zero Training☆34Updated 3 weeks ago
- Chorale Music Separation Dataset and Model Framework☆35Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆63Updated last week
- ISMIR 24 Supplementary Material☆14Updated 8 months ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆38Updated last year
- Official code of ElasticAST (Interspeech 2024 paper)☆32Updated 10 months ago