IMLHF / SpecAugmentPyTorchLinks
A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆13Updated last year
Alternatives and similar repositories for SpecAugmentPyTorch
Users that are interested in SpecAugmentPyTorch are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆43Updated 11 months ago
- Unofficial PyTorch implementation of Masked Autoencoders that Listen☆70Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- ☆31Updated 2 years ago
- ☆66Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆88Updated 3 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Updated 6 years ago
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27Updated 5 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆35Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 4 years ago
- ☆29Updated 3 years ago
- ☆19Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆73Updated 3 years ago
- experiments about AudioSet☆44Updated 2 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆34Updated 4 months ago
- ☆27Updated 3 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated last year
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 4 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- ☆54Updated 5 years ago
- ☆10Updated 2 weeks ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Updated 3 years ago
- This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.☆21Updated 4 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated 2 years ago
- ☆30Updated 4 years ago