lessw2020 / FAdam_PyTorchLinks
an implementation of FAdam (Fisher Adam) in PyTorch
☆48Updated last month
Alternatives and similar repositories for FAdam_PyTorch
Users that are interested in FAdam_PyTorch are comparing it to the libraries listed below
Sorting:
- Implementation of the proposed Adam-atan2 from Google Deepmind in Pytorch☆112Updated 8 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆90Updated 5 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 10 months ago
- Implementation of Agent Attention in Pytorch☆91Updated last year
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆88Updated last month
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆19Updated last year
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 8 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆52Updated 6 months ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆127Updated last year
- The Gaussian Histogram Loss (HL-Gauss) proposed by Imani et al. with a few convenient wrappers for regression, in Pytorch☆65Updated 2 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆74Updated 3 weeks ago
- Implementation of Google's USM speech model in Pytorch☆31Updated 2 weeks ago
- small audio language model for reasoning☆71Updated 3 months ago
- My attempts at applying Soundstream design on learned tokenization of text and then applying hierarchical attention to text generation☆88Updated 9 months ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆118Updated 2 months ago
- Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".☆63Updated last year
- ☆48Updated 11 months ago
- Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"☆148Updated 8 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆77Updated last month
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆100Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆13Updated 2 weeks ago
- Explore how to get a VQ-VAE models efficiently!☆39Updated 2 weeks ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆41Updated this week
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆17Updated 5 months ago
- Official release of StyleTalk dataset.☆67Updated last year
- Official PyTorch implementation for "MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens…☆32Updated last month
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆81Updated this week