lessw2020 / FAdam_PyTorchLinks
an implementation of FAdam (Fisher Adam) in PyTorch
☆50Updated 7 months ago
Alternatives and similar repositories for FAdam_PyTorch
Users that are interested in FAdam_PyTorch are comparing it to the libraries listed below
Sorting:
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆96Updated 11 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆135Updated 3 months ago
- ☆53Updated last year
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆129Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated last year
- Implementation of Agent Attention in Pytorch☆93Updated last year
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆24Updated 4 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated last week
- ☆26Updated last year
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆124Updated 3 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- small audio language model for reasoning☆86Updated 2 months ago
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆168Updated 2 weeks ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Updated 11 months ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆167Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated last year
- ☆48Updated 9 months ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Updated last year
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Updated 11 months ago
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Implementation of Google's USM speech model in Pytorch☆34Updated 2 weeks ago
- unofficial Split Mean Flow Implementation from bytedance☆66Updated 5 months ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆132Updated 4 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆90Updated last year