AppleHolic/audioset_augmentor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AppleHolic/audioset_augmentor)

AppleHolic / audioset_augmentor

Sound augmentation using Large-scale audio dataset (Audioset)

☆45

Alternatives and similar repositories for audioset_augmentor

Users that are interested in audioset_augmentor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AppleHolic / pytorch_sound
View on GitHub
Sound Related Deep Learning Tasks boosting repository with pytorch
☆88Jul 25, 2024Updated 2 years ago
IBM / audioset-classification
View on GitHub
Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning
☆102Sep 17, 2025Updated 10 months ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
yc9701 / pansori
View on GitHub
Tools for ASR Corpus Generation from Online Video
☆140Feb 10, 2019Updated 7 years ago
AppleHolic / source_separation
View on GitHub
Deep learning based speech source separation using Pytorch
☆319Nov 20, 2020Updated 5 years ago
gooofy / py-nltools
View on GitHub
A collection of basic python modules for spoken natural language processing
☆55Dec 1, 2019Updated 6 years ago
jim-schwoebel / download_audioset
View on GitHub
📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
☆106Aug 1, 2023Updated 2 years ago
athena-team / DiDiSpeech
View on GitHub
☆45Oct 24, 2020Updated 5 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
rafaelvalle / asrgen
View on GitHub
Attacking Speaker Recognition with Deep Generative Models
☆34Mar 24, 2023Updated 3 years ago
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
csteinmetz1 / MixCNN
View on GitHub
Convolutional Neural Network for multitrack mix leveling
☆19Jun 25, 2018Updated 8 years ago
qiuqiangkong / audioset_source_separation
View on GitHub
☆17Feb 14, 2020Updated 6 years ago
tqbl / ood_audio
View on GitHub
An audio classification system for learning with out-of-distribution data
☆33Dec 8, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nkrnrnk / BertPunc
View on GitHub
SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model
☆182May 17, 2019Updated 7 years ago
keunwoochoi / ismir-2019-posters
View on GitHub
☆75Jan 6, 2020Updated 6 years ago
qiuqiangkong / audioset_classification
View on GitHub
☆229Feb 9, 2020Updated 6 years ago
ZhihaoDU / speech_feature_extractor
View on GitHub
Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…
☆129Aug 12, 2020Updated 5 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
google-research-datasets / uninum
View on GitHub
A database of number names for 186 languages, locales, and scripts
☆67Mar 3, 2023Updated 3 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
jim-schwoebel / voiceome
View on GitHub
🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…
☆32Apr 2, 2025Updated last year
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yinruiqing / change_detection
View on GitHub
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆67Jul 14, 2020Updated 6 years ago
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
jinserk / pytorch-asr
View on GitHub
ASR with PyTorch
☆139Mar 10, 2019Updated 7 years ago
thu-ml / LM-Calibration
View on GitHub
☆17May 31, 2023Updated 3 years ago
lezasantaizi / audio_cut
View on GitHub
语音切割，python ，webrtc
☆11Sep 28, 2018Updated 7 years ago
RicherMans / GPV
View on GitHub
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆141Aug 3, 2023Updated 2 years ago
snsun / pit-speech-separation
View on GitHub
☆131Aug 9, 2018Updated 7 years ago