yukara-ikemiya / Swin-Transformer-1dLinks
PyTorch implementation of Swin Transformer for 1-dimensional data
☆17Updated last year
Alternatives and similar repositories for Swin-Transformer-1d
Users that are interested in Swin-Transformer-1d are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆45Updated 7 months ago
- ☆85Updated 2 years ago
- Chorale Music Separation Dataset and Model Framework☆40Updated 3 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Updated last year
- Da - ECHO - RetrievAl - daTasEt☆34Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆14Updated last year
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆26Updated last month
- ☆13Updated 4 months ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆67Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆51Updated 9 months ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Updated 2 years ago
- ☆39Updated last year
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆33Updated 10 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆69Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆56Updated 3 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆61Updated last year
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆22Updated last year
- Sound field estimation based on physics-constrained neural kernel☆21Updated 7 months ago
- Code and datasets for 'Few-Shot Audio-Visual Learning of Environment Acoustics' (NeurIPS 2022)☆23Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34Updated last year
- Implementation of FiNS model for RIR estimation☆37Updated 2 years ago
- ☆32Updated 2 weeks ago
- Prediction of sound event bounding boxes (SEBBs)☆31Updated last year
- ☆66Updated 2 years ago
- OpenFLAM: Frame-Wise Language-Audio Model☆26Updated 2 weeks ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Updated 2 years ago
- This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.☆40Updated 3 years ago