KimJeongSun/SpecAugment_numpy_scipy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KimJeongSun/SpecAugment_numpy_scipy)

KimJeongSun / SpecAugment_numpy_scipy

fast SpecAugmentation code with numpy and scipy

☆31

Alternatives and similar repositories for SpecAugment_numpy_scipy

Users that are interested in SpecAugment_numpy_scipy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
andi611 / Conditional-SpecGAN-Tensorflow
View on GitHub
Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network
☆10Dec 12, 2018Updated 7 years ago
Kyubyong / specAugment
View on GitHub
Tensor2tensor experiment with SpecAugment
☆46May 13, 2019Updated 7 years ago
mdangschat / speech-corpus-dl
View on GitHub
Download and preperation tool for free speech corpora.
☆16Apr 28, 2019Updated 7 years ago
aidiary / urban-sound-classification-keras
View on GitHub
☆14Oct 2, 2017Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
AmirmohammadRostami / KeywordsSpotting-EfficientNet-A0
View on GitHub
EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting
☆23Jun 16, 2022Updated 4 years ago
idiap / CNN_QbE_STD
View on GitHub
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Sep 3, 2018Updated 7 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
foundintranslation / Kaldi
View on GitHub
Kaldi Snapshot
☆31Mar 13, 2013Updated 13 years ago
mpsilfve / phonembedding
View on GitHub
☆14Dec 7, 2018Updated 7 years ago
xk-wang / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆11Jan 29, 2022Updated 4 years ago
olix20 / google_keyword_detection_challenge
View on GitHub
https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/
☆21Mar 1, 2018Updated 8 years ago
thu-coai / TaiLr
View on GitHub
ICLR2023 - Tailoring Language Generation Models under Total Variation Distance
☆21Feb 8, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mguner / audio_search
View on GitHub
Use speech_to_text for keyword search in audio files.
☆12May 5, 2021Updated 5 years ago
carlfm01 / Tacotron-2
View on GitHub
DeepMind's Tacotron-2 Tensorflow implementation
☆16Oct 3, 2021Updated 4 years ago
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
SmoothKen / KKNet
View on GitHub
An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023
☆23Jan 16, 2024Updated 2 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
averkij / Word-to-Number-Russian
View on GitHub
Проект для перевода чисел, записанных в текстовом виде на русском языке.
☆11Apr 5, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lifeiteng / Aligner-SUPERB
View on GitHub
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆39May 7, 2025Updated last year
stefan-baumann / inceptionkeynet
View on GitHub
A CNN model for key estimation in music recordings
☆19Aug 2, 2023Updated 2 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
Shb742 / rnnoise_python
View on GitHub
python wrapper for rnnoise library
☆48Jan 5, 2023Updated 3 years ago
wangkenpu / Adaptation-Interspeech18
View on GitHub
Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model
☆13Nov 25, 2019Updated 6 years ago
HawkAaron / E2E-ASR
View on GitHub
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Jun 10, 2019Updated 7 years ago
hosackm / BiquadFilter
View on GitHub
Biquad Filter implementation in C using Portaudio
☆19Aug 20, 2024Updated last year
MlWoo / LPCNet
View on GitHub
Efficient neural speech synthesis
☆81Nov 25, 2020Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
touhi99 / N-gram-Language-model
View on GitHub
Programming for NLP Project - Implement a basic n-gram language model and generate sentence using beam search
☆11Mar 10, 2020Updated 6 years ago
manishmalik / Voice-Classification
View on GitHub
Gender Classification from voice
☆10Apr 27, 2015Updated 11 years ago
mravanelli / pytorch_MLP_for_ASR
View on GitHub
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆40Feb 10, 2018Updated 8 years ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago
xflash96 / query_completion
View on GitHub
☆18Apr 25, 2019Updated 7 years ago
sainathadapa / mediaeval-2019-moodtheme-detection
View on GitHub
4th position solution to the MediaEval - The 2019 Emotion and Themes in Music using Jamendo
☆15Nov 13, 2019Updated 6 years ago
genzen2103 / Emotion-Detection-in-speech-using-Acoustic-and-Neural-Features
View on GitHub
System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…
☆10Nov 15, 2017Updated 8 years ago