facebookresearch / flashy
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
☆103Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for flashy
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆82Updated last month
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated last month
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆62Updated 2 years ago
- Scalable and Performant Data Loading☆66Updated this week
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆83Updated last year
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- Implementation of DiffWave and SaShiMi audio generation models☆118Updated last year
- Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experimen…☆272Updated last year
- A collection of useful audio datasets and transforms for PyTorch.☆133Updated last year
- ☆15Updated 2 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- ☆62Updated 3 months ago
- ☆84Updated 7 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆109Updated last year
- ☆53Updated 3 weeks ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆71Updated 2 months ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 5 months ago
- The demo page of UniAudio☆34Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- Audiogen Codec☆127Updated 4 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated last year
- PyTorch wrappers for using your model in audacity!☆173Updated last year
- My explorations into editing the knowledge and memories of an attention network☆34Updated last year
- PyTorch Dataset for Speech and Music audio☆73Updated 4 months ago
- ☆22Updated last month
- ☆31Updated 2 years ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆142Updated 10 months ago
- A repository for benchmarking neural vocoders by their quality and speed.☆203Updated last month
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 3 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆88Updated 3 months ago