kyegomez / SimplifiedTransformersLinks
SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-blocks, and normalization layers are removed. Experimental results confirm similar training speed and performance.
☆15Updated 3 weeks ago
Alternatives and similar repositories for SimplifiedTransformers
Users that are interested in SimplifiedTransformers are comparing it to the libraries listed below
Sorting:
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆20Updated 4 months ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Updated last month
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated 2 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Updated last year
- an implementation of FAdam (Fisher Adam) in PyTorch☆49Updated 5 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆19Updated 10 months ago
- ☆26Updated last year
- ☆11Updated last year
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆118Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Updated last year
- ESLTTS dataset☆16Updated 10 months ago
- The source code for the paper CrossSinger (asru2023)☆18Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆24Updated 9 months ago
- PyTorch-based implementations of short-time Fourier transform☆15Updated 5 months ago
- My implementation of diffusion (like) models☆11Updated 2 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆13Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 10 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated last year
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- ☆19Updated last year
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆36Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆31Updated 2 years ago
- ☆41Updated 5 months ago
- ☆11Updated 2 years ago
- ☆13Updated 4 months ago