facebookresearch / flashyLinks
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
☆117Updated last year
Alternatives and similar repositories for flashy
Users that are interested in flashy are comparing it to the libraries listed below
Sorting:
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆89Updated last year
- Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experimen…☆308Updated 2 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆94Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆127Updated 2 years ago
- ☆86Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆69Updated 3 years ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated 2 years ago
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆47Updated last week
- A collection of useful audio datasets and transforms for PyTorch.☆143Updated 2 years ago
- ☆67Updated last year
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆89Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- The demo page of UniAudio☆34Updated last year
- ☆32Updated 3 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- ☆61Updated 2 years ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated 2 years ago
- ☆62Updated last year
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Updated last year
- Audiogen Codec☆144Updated last year
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆72Updated 6 months ago
- PyTorch wrappers for using your model in audacity!☆180Updated 2 years ago
- Contrastive Language-Audio Pretraining☆15Updated 4 years ago
- Official implementation of "Contrastive Audio-Language Learning for Music" (ISMIR 2022)☆122Updated last year
- ☆15Updated 3 years ago
- ☆23Updated 2 years ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆165Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆115Updated 2 years ago