google-research / seanetLinks
☆130Updated last year
Alternatives and similar repositories for seanet
Users that are interested in seanet are comparing it to the libraries listed below
Sorting:
- A collection of useful audio datasets and transforms for PyTorch.☆144Updated 3 years ago
- This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.☆367Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆210Updated 3 years ago
- A repository for benchmarking neural vocoders by their quality and speed.☆212Updated 8 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- ☆94Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆100Updated last year
- Pitch Estimating Neural Networks (PENN)☆269Updated 10 months ago
- VoiceLDM: Text-to-Speech with Environmental Context☆192Updated last year
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆87Updated 3 years ago
- A large synthetic dataset of spatial audio with multiple labels☆123Updated 2 years ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆130Updated last year
- PyTorch Dataset for Speech and Music audio☆80Updated last year
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆73Updated 2 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆153Updated 3 years ago
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Updated last year
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆127Updated 4 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Updated 5 years ago
- A collection of pre-trained audio models, in PyTorch.☆115Updated 3 years ago
- Benchmark popular audio i/o packages☆151Updated 2 years ago
- Official implementation of SawSing (ISMIR'22)☆272Updated 3 years ago
- Convmelspec: Convertible Melspectrograms via 1D Convolutions☆147Updated last year
- ☆74Updated last year
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆281Updated last week
- This code is to run the WARP-Q speech quality metric.☆35Updated last year
- ☆44Updated last year
- Pytorch implementation of BigVSAN☆203Updated 2 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Updated 2 years ago