hipudding / pytorch-lightningLinks
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
☆10Updated last year
Alternatives and similar repositories for pytorch-lightning
Users that are interested in pytorch-lightning are comparing it to the libraries listed below
Sorting:
- Keras implement of Finite Scalar Quantization☆73Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆40Updated last year
- ☆25Updated 9 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆27Updated 3 months ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆61Updated 10 months ago
- ☆23Updated 8 months ago
- A Pytorch Implementation of Finite Scalar Quantization☆136Updated last year
- Paper, Code and Statistics for Speech Generatation.☆10Updated 2 years ago
- Shortcut flow matching Pytorch implementation☆48Updated 5 months ago
- A Neural Audio Codec (NAC) for Universal Audio☆28Updated last week
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆69Updated 9 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆59Updated 7 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- TensorFlow implementation of "Finite Scalar Quantization: VQ-VAE Made Simple" (ICLR 2024)☆18Updated last year
- ESLTTS dataset☆16Updated 4 months ago
- ☆28Updated last year
- [INTERSPEECH 2025]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆30Updated this week
- SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆264Updated 5 months ago
- ☆13Updated last year
- Streaming Vocos☆26Updated 4 months ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated 2 years ago
- ☆25Updated 10 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)☆76Updated 5 months ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆44Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 2 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆41Updated this week
- A spoken version of the textual story cloze benchmark☆17Updated last year
- The demo page for ALMTokenizer☆48Updated last month