facebookresearch / flashy
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
☆110Updated last year
Alternatives and similar repositories for flashy:
Users that are interested in flashy are comparing it to the libraries listed below
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆87Updated 2 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆139Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆68Updated 2 years ago
- Implementation of DiffWave and SaShiMi audio generation models☆122Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆86Updated 6 months ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆85Updated last year
- ☆64Updated 8 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 6 months ago
- Audiogen Codec☆135Updated 9 months ago
- Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experimen…☆282Updated last year
- PyTorch wrappers for using your model in audacity!☆174Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- ☆15Updated 2 years ago
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆65Updated 11 months ago
- ☆66Updated 3 weeks ago
- ☆84Updated last year
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆95Updated 9 months ago
- ☆84Updated last year
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆120Updated 8 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆113Updated 2 years ago
- A GPU accelerated and torch based audio DSP library☆70Updated this week
- ☆31Updated 2 years ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated last year
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Updated this week
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆103Updated 4 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight …☆235Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆58Updated last week
- ☆59Updated last year
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Updated last year