fast SpecAugmentation code with numpy and scipy
☆31Jul 5, 2019Updated 6 years ago
Alternatives and similar repositories for SpecAugment_numpy_scipy
Users that are interested in SpecAugment_numpy_scipy are comparing it to the libraries listed below
Sorting:
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Jan 19, 2021Updated 5 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆11Jan 29, 2022Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- ☆14Oct 2, 2017Updated 8 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆16Oct 3, 2021Updated 4 years ago
- ☆14Dec 7, 2018Updated 7 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Sep 3, 2018Updated 7 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 6 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆500Jun 11, 2021Updated 4 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Feb 8, 2023Updated 3 years ago
- In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses co…☆20Jul 18, 2018Updated 7 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Mar 1, 2018Updated 8 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Jan 16, 2024Updated 2 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Simple example how to use tensorflow's CTC loss with Voxforge speech data☆18Nov 12, 2016Updated 9 years ago
- A hub for ResNet based models and pretrained weights in TensorFlow.☆21Aug 5, 2021Updated 4 years ago
- python wrapper for rnnoise library☆48Jan 5, 2023Updated 3 years ago
- Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…☆26Sep 13, 2020Updated 5 years ago
- Speech Recognition for Uyghur using Speech transformer☆28Jun 19, 2021Updated 4 years ago
- wavenet vocoder using tensorflow☆26Feb 18, 2018Updated 8 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Mar 21, 2019Updated 6 years ago
- GaugeMeterView is view which can be used in different Meter applications☆12Feb 25, 2022Updated 4 years ago
- # Vue 项目开源 凡客网站重构项目,具有完整的业务流程,以及后台数据api☆12Jan 15, 2022Updated 4 years ago
- python script for voice activity detection.☆36Aug 16, 2024Updated last year
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Feb 10, 2018Updated 8 years ago
- PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)☆39Oct 12, 2019Updated 6 years ago
- Spell correction language model for Uyghur language based on transformer neural network☆14Jun 18, 2025Updated 8 months ago
- Build ontology view step-by-step on OpenGroup's ArchiMate 3.2 Specification☆14Jan 27, 2026Updated last month
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- A static site generator for SiYuan Note (思源笔记) app☆14Oct 18, 2025Updated 4 months ago
- Speech Recognition and Simple AI Summary:可用于本地语音转文字、说话人分割及简易的AI总结,搭配web端操作界面。☆11Jul 22, 2024Updated last year
- extract chords from an audio file (using ohollo/chord-extractor & Chordino)☆12Mar 23, 2025Updated 11 months ago
- Use speech_to_text for keyword search in audio files.☆12May 5, 2021Updated 4 years ago