Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch
☆135Oct 15, 2025Updated 5 months ago
Alternatives and similar repositories for adam-atan2-pytorch
Users that are interested in adam-atan2-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Aug 18, 2024Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆220Feb 13, 2023Updated 3 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Nov 1, 2023Updated 2 years ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆133Sep 25, 2023Updated 2 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Oct 11, 2024Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆186Aug 3, 2025Updated 7 months ago
- Implementation of the algorithm detailed in paper "Evolutionary design of molecules based on deep learning and a genetic algorithm"☆24Dec 15, 2023Updated 2 years ago
- Unofficial implementation of GotenNet, new SOTA 3d equivariant transformer, in Pytorch☆67Apr 7, 2025Updated 11 months ago
- ☆21Jul 15, 2024Updated last year
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆213Apr 26, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- Implementation of Flash Attention in Jax☆227Mar 1, 2024Updated 2 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆93Jul 4, 2024Updated last year
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆113Jun 4, 2025Updated 9 months ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Sep 18, 2025Updated 6 months ago
- Official implementation of the TTS model Lina-Speech☆179Jan 9, 2025Updated last year
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- ☆30Oct 3, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆433May 17, 2023Updated 2 years ago
- Helpful tools and examples for working with flex-attention☆1,161Feb 8, 2026Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- Transformers components but in Triton☆34May 9, 2025Updated 10 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆755Nov 19, 2024Updated last year
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)☆10Mar 9, 2024Updated 2 years ago