facebookresearch / flashy
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpointing, logging, distributed, compatibility with Dora, and more!
☆107Updated 11 months ago
Alternatives and similar repositories for flashy:
Users that are interested in flashy are comparing it to the libraries listed below
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆83Updated 4 months ago
- Implementation of DiffWave and SaShiMi audio generation models☆121Updated last year
- A collection of useful audio datasets and transforms for PyTorch.☆137Updated 2 years ago
- A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.☆66Updated 2 years ago
- A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.☆85Updated last year
- Audiogen Codec☆131Updated 7 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 4 months ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆194Updated last year
- ☆83Updated last year
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆80Updated 6 months ago
- PyTorch Dataset for Speech and Music audio☆73Updated 7 months ago
- ☆31Updated 2 years ago
- Song Describer is a data collection platform for annotating music with textual descriptions.☆57Updated 3 months ago
- ☆66Updated this week
- The demo page of UniAudio☆33Updated last year
- ☆64Updated 6 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆94Updated 7 months ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆111Updated 2 years ago
- ☆84Updated 11 months ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- PyTorch wrappers for using your model in audacity!☆173Updated last year
- Trainer for audio-diffusion-pytorch☆128Updated 2 years ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experimen…☆280Updated last year
- A repository for benchmarking neural vocoders by their quality and speed.☆208Updated this week
- A collection of audio autoencoders, in PyTorch.☆40Updated last year
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆63Updated 9 months ago
- ☆82Updated last year