SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
β96Sep 5, 2020Updated 5 years ago
Alternatives and similar repositories for SpecAugment
Users that are interested in SpecAugment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¦A curated collection of handy code snippets, shell functions, and developer tipsβ20Jun 1, 2026Updated last week
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brainβ654Apr 5, 2022Updated 4 years ago
- β18Apr 12, 2021Updated 5 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.β28Apr 1, 2026Updated 2 months ago
- Learnable Gammatone Filterbank (LGTFB) and Equal-loudness Normalization (EN)β13Apr 24, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A python implementation of a traditional Dynamic Range Compressorβ14Oct 30, 2020Updated 5 years ago
- β17Aug 9, 2024Updated last year
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"β12Mar 11, 2019Updated 7 years ago
- β18Nov 15, 2021Updated 4 years ago
- β13Jul 14, 2024Updated last year
- fast SpecAugmentation code with numpy and scipyβ31Jul 5, 2019Updated 6 years ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Modelsβ23Jul 10, 2024Updated last year
- β15Jun 26, 2025Updated 11 months ago
- creating audio preprocessing features in TensorFlow keras layers,β14Jul 13, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3β47Apr 5, 2023Updated 3 years ago
- A TensorFlow-based spoken language identificationβ99Mar 22, 2023Updated 3 years ago
- A list of current Audio-Vision Multimodal with awesome resources (paper, application, data, review, survey, etc.).β32Oct 11, 2023Updated 2 years ago
- π A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).β19Apr 1, 2021Updated 5 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)β12Aug 9, 2018Updated 7 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapesβ77Oct 8, 2025Updated 8 months ago
- tf 2.0 implementation of Listen, attend and spellβ21Jan 19, 2021Updated 5 years ago
- Chainer implementation of between-class learning for sound recognition https://arxiv.org/abs/1711.10282β95Mar 27, 2018Updated 8 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.β96May 25, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".β425Aug 14, 2022Updated 3 years ago
- (Interspeech 2025, official code) Speech enhancement based on cascaded two flowsβ16Sep 1, 2025Updated 9 months ago
- Keyword spotting for audio with attention (KWS model for audio)β18Jul 15, 2021Updated 4 years ago
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory Soβ¦β18Dec 5, 2024Updated last year
- Domain Graph core libraryβ17Jan 14, 2023Updated 3 years ago
- Tools for speech processing, keyword spottingβ16Mar 11, 2020Updated 6 years ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Iβ¦β19Dec 5, 2024Updated last year
- β12Jun 22, 2020Updated 5 years ago
- Code for the paper "A Data-Driven Methodology for Considering Feasibility and Pairwise Likelihood in Deep Learning Based Guitar Tablatureβ¦β19Dec 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Improved Speech Enhancement GANsβ13Jun 24, 2020Updated 5 years ago
- Self-Supervised Contrastive Learning of Music Spectrogramsβ31May 10, 2021Updated 5 years ago
- R package with functions to calculate indices for soundscape ecology and other ecology research that uses audio recordings.β28Apr 6, 2019Updated 7 years ago
- Keras implementation of musicnn, a set of pre-trained deep convolutional neural networks for music audio taggingβ27May 17, 2021Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).β32Jun 27, 2019Updated 6 years ago
- An integrated framework for DWI Image QC and processingβ13Mar 9, 2026Updated 3 months ago
- Group review spammer detectionβ10Sep 9, 2019Updated 6 years ago