lessw2020 / FAdam_PyTorch
an implementation of FAdam (Fisher Adam) in PyTorch
☆43Updated 10 months ago
Alternatives and similar repositories for FAdam_PyTorch:
Users that are interested in FAdam_PyTorch are comparing it to the libraries listed below
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 6 months ago
- small audio language model for reasoning☆55Updated 3 weeks ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆38Updated last week
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆112Updated 2 years ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆67Updated 7 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆52Updated 5 months ago
- Implementation of Google's USM speech model in Pytorch☆30Updated last week
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆16Updated 8 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆64Updated 3 weeks ago
- ☆11Updated 8 months ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 2 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Updated 4 months ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆61Updated 8 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated 7 months ago
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆84Updated 2 months ago
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆21Updated 2 weeks ago
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 9 months ago
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆103Updated 4 months ago
- Code repository for FreGrad☆50Updated 10 months ago
- Official PyTorch implementation of "Paralinguistics-Aware Speech-Empowered LLMs for Natural Conversation" (NeurIPS 2024)☆87Updated 4 months ago
- ☆35Updated last year
- ☆24Updated last week
- Official release of StyleTalk dataset.☆62Updated 9 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆81Updated last month
- An official implementation of Style-Talker for Spoken Dialogue Generation☆17Updated 3 months ago
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆11Updated 4 months ago
- Official repository of Wavehax vocoder☆46Updated 4 months ago
- ☆13Updated last year
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆14Updated 10 months ago