hipudding / pytorch-lightning
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
☆9Updated last year
Alternatives and similar repositories for pytorch-lightning:
Users that are interested in pytorch-lightning are comparing it to the libraries listed below
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆61Updated 9 months ago
- Keras implement of Finite Scalar Quantization☆71Updated last year
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆36Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆27Updated last month
- ☆24Updated 8 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- A spoken version of the textual story cloze benchmark☆16Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆67Updated 8 months ago
- Shortcut flow matching Pytorch implementation☆32Updated 3 months ago
- ☆24Updated 2 years ago
- ☆22Updated 6 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- ☆28Updated last year
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆29Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆48Updated this week
- The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)☆72Updated 4 months ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆44Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- MIO: A Foundation Model on Multimodal Tokens☆25Updated 4 months ago
- Source code for the paper 'Audio Captioning Transformer'☆54Updated 3 years ago
- ☆25Updated 2 years ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated 9 months ago
- Streaming Vocos☆24Updated 3 months ago
- ☆29Updated last week
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆49Updated 5 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Updated 8 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Updated 2 years ago
- ☆46Updated 3 months ago