pyyush/SpecAugment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pyyush/SpecAugment)

pyyush / SpecAugment

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

☆96

Alternatives and similar repositories for SpecAugment

Users that are interested in SpecAugment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VSydorskyy / BirdCLEF_2025_2nd_place
View on GitHub
☆25Oct 25, 2025Updated 8 months ago
kunimi00 / ContrastiveSSLMusicAudio
View on GitHub
☆13Jun 2, 2022Updated 4 years ago
haideraltahan / CLAR
View on GitHub
☆18Apr 12, 2021Updated 5 years ago
hspark84 / lgtfb-en
View on GitHub
Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)
☆13Apr 24, 2020Updated 6 years ago
Xia-aaa / L3former
View on GitHub
☆14Jun 26, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
contactless-healthcare / Deep-Learning-for-Lung-Sound-Analysis
View on GitHub
☆14Jan 10, 2024Updated 2 years ago
djmoffat / pyCompressor
View on GitHub
A python implementation of a traditional Dynamic Range Compressor
☆14Oct 30, 2020Updated 5 years ago
alan-turing-institute / Turing-RSS-Health-Data-Lab-Biomedical-Acoustic-Markers
View on GitHub
☆17Aug 9, 2024Updated last year
yingtaoHuo / wakeUp
View on GitHub
Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"
☆12Mar 11, 2019Updated 7 years ago
aeesha-T / parkinsons_prediction_using_speech
View on GitHub
☆18Nov 15, 2021Updated 4 years ago
KimJeongSun / SpecAugment_numpy_scipy
View on GitHub
fast SpecAugmentation code with numpy and scipy
☆31Jul 5, 2019Updated 7 years ago
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
infected4098 / Wave-U-Mamba
View on GitHub
An official documentation of the paper <Wave-U-Mamba: An End-To-End Framework For High-Quality And Efficient Speech Super Resolution>.
☆26Oct 29, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
danielkrause / DCASE2022-data-generator
View on GitHub
Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3
☆47Apr 5, 2023Updated 3 years ago
SpeechFlow-io / Spoken_language_identification
View on GitHub
A TensorFlow-based spoken language identification
☆100Mar 22, 2023Updated 3 years ago
kongkip / spela
View on GitHub
creating audio preprocessing features in TensorFlow keras layers,
☆14Jul 13, 2021Updated 5 years ago
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
ServerSideHannes / las
View on GitHub
tf 2.0 implementation of Listen, attend and spell
☆21Jan 19, 2021Updated 5 years ago
jx1100370217 / ASR_dosmono
View on GitHub
Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)
☆12Aug 9, 2018Updated 7 years ago
kuielab / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
☆20Apr 1, 2021Updated 5 years ago
mdangschat / speech-corpus-dl
View on GitHub
Download and preperation tool for free speech corpora.
☆16Apr 28, 2019Updated 7 years ago
mil-tokyo / bc_learning_sound
View on GitHub
Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282
☆95Mar 27, 2018Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
YuanGongND / ssast
View on GitHub
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
☆427Aug 14, 2022Updated 3 years ago
lim3944 / ReMixmatch-pytorch
View on GitHub
☆14Oct 14, 2020Updated 5 years ago
HarunoriKawano / BEST-RQ
View on GitHub
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆96May 25, 2023Updated 3 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
zhuowangsylu / ColluEagle
View on GitHub
Group review spammer detection
☆10Sep 9, 2019Updated 6 years ago
OptimusPrimus / tacos
View on GitHub
Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining
☆16Oct 12, 2025Updated 9 months ago
wdqqdw / Echo
View on GitHub
Project page of "2026-ICLR Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning"
☆16Mar 26, 2026Updated 3 months ago
Kirili4ik / kws-attention-pytorch
View on GitHub
Keyword spotting for audio with attention (KWS model for audio)
☆18Jul 15, 2021Updated 5 years ago
lysanderism / TimeAudio
View on GitHub
The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…
☆30Nov 18, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
linto-ai / pyrtstools
View on GitHub
Tools for speech processing, keyword spotting
☆16Mar 11, 2020Updated 6 years ago
domain-graph / domain-graph
View on GitHub
Domain Graph core library
☆17Jan 14, 2023Updated 3 years ago
kaen2891 / stethoscope-guided_supervised_contrastive_learning
View on GitHub
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…
☆18Dec 5, 2024Updated last year
daisukelab / image-anomaly-det
View on GitHub
☆12Jun 22, 2020Updated 6 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
mguner / audio_search
View on GitHub
Use speech_to_text for keyword search in audio files.
☆12May 5, 2021Updated 5 years ago
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year