google-research / seanetLinks
☆128Updated last year
Alternatives and similar repositories for seanet
Users that are interested in seanet are comparing it to the libraries listed below
Sorting:
- ☆43Updated last year
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆366Updated 2 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆211Updated 5 months ago
- A collection of useful audio datasets and transforms for PyTorch.☆141Updated 2 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆81Updated 4 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆209Updated 3 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated 2 years ago
- The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"☆361Updated 2 years ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆256Updated 2 weeks ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆86Updated 3 years ago
- open-source audio datasets☆154Updated 2 years ago
- ☆91Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- A large synthetic dataset of spatial audio with multiple labels☆117Updated 2 years ago
- ☆67Updated 5 months ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Pitch Estimating Neural Networks (PENN)☆268Updated 7 months ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆121Updated 11 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆120Updated 2 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 4 years ago
- Benchmark popular audio i/o packages☆152Updated last year
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆138Updated last year
- Fast and high quality sample-rate conversion library for Python☆103Updated last month
- Audiogen Codec☆143Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆186Updated last year
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆162Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆152Updated 3 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆60Updated 11 months ago