kyegomez / SimplifiedTransformersLinks
SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-blocks, and normalization layers are removed. Experimental results confirm similar training speed and performance.
☆14Updated 3 weeks ago
Alternatives and similar repositories for SimplifiedTransformers
Users that are interested in SimplifiedTransformers are comparing it to the libraries listed below
Sorting:
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆18Updated 3 months ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆18Updated 9 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- ☆26Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- an implementation of FAdam (Fisher Adam) in PyTorch☆49Updated 4 months ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆27Updated 2 weeks ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆13Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- My implementation of diffusion (like) models☆11Updated 2 years ago
- Phonemes and durations labeling based on whisper small☆11Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- ☆40Updated 4 months ago
- speaker-disentangled speech linguistic content quantizer☆23Updated 8 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- text to speech☆10Updated last year
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- A Neural Audio Codec (NAC) for Universal Audio☆43Updated 5 months ago
- The source code for the paper CrossSinger (asru2023)☆18Updated 2 years ago
- GPT for FACodec☆13Updated last year
- Parallel waveform generation with DiffusionGAN☆17Updated 3 years ago
- 4G GPU & 10 Minutes for train☆12Updated 2 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆12Updated 11 months ago
- ESLTTS dataset☆16Updated 9 months ago
- Code for INTERSPEECH 2023 paper "mdctGAN: Taming transformer-based GAN for speech super-resolution with Modified DCT spectra"☆62Updated 2 years ago
- ☆11Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- ☆19Updated 2 years ago
- ☆16Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Updated last year