google-research / seanet
☆126Updated 7 months ago
Alternatives and similar repositories for seanet
Users that are interested in seanet are comparing it to the libraries listed below
Sorting:
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆273Updated last month
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆96Updated 9 months ago
- Pitch Estimating Neural Networks (PENN)☆251Updated last month
- A collection of useful audio datasets and transforms for PyTorch.☆139Updated 2 years ago
- Utility tools for the "Divide and Remaster" dataset, introduced as part of the Cocktail Fork problem paper: https://arxiv.org/abs/2110.09…☆72Updated 2 years ago
- SDX23 startkit for the Demucs baselines.☆28Updated 2 years ago
- A large synthetic dataset of spatial audio with multiple labels☆106Updated last year
- ☆82Updated 2 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆78Updated 4 years ago
- Convmelspec: Convertible Melspectrograms via 1D Convolutions☆139Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆175Updated 9 months ago
- Official implementation of SawSing (ISMIR'22)☆262Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated last year
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆115Updated 2 years ago
- Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]☆33Updated 7 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated 2 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆54Updated 3 years ago
- PyTorch Dataset for Speech and Music audio☆75Updated 10 months ago
- A PyTorch implementation of the paper: "LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation" (ICAS…☆85Updated 2 years ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆85Updated 2 years ago
- ☆43Updated 11 months ago
- A DDSP-based neural voice synthesiser.☆116Updated 6 months ago
- Self-supervised learning for fast pitch estimation☆219Updated 2 months ago
- A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB …☆170Updated 11 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Who calls the shots? Rethinking Few-Shot Learning for Audio (WASPAA 2021)☆43Updated 2 years ago
- A complete training recipe for kaldi-based Automatic Lyrics Transcription.☆31Updated 3 years ago
- Audiogen Codec☆135Updated 10 months ago
- Pytorch implementation of BigVSAN☆204Updated last year