AppleHolic / audioset_augmentorView external linksLinks
Sound augmentation using Large-scale audio dataset (Audioset)
☆45Jun 29, 2021Updated 4 years ago
Alternatives and similar repositories for audioset_augmentor
Users that are interested in audioset_augmentor are comparing it to the libraries listed below
Sorting:
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Convolutional Neural Network for multitrack mix leveling☆18Jun 25, 2018Updated 7 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- Tools for ASR Corpus Generation from Online Video☆140Feb 10, 2019Updated 7 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- Sound Related Deep Learning Tasks boosting repository with pytorch☆88Jul 25, 2024Updated last year
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Supplemental material for the paper "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings"☆20May 6, 2021Updated 4 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 6 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- A collection of basic python modules for spoken natural language processing☆55Dec 1, 2019Updated 6 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Mar 24, 2023Updated 2 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Jan 28, 2026Updated 2 weeks ago
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Sep 17, 2025Updated 4 months ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 6 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- ☆75Jan 6, 2020Updated 6 years ago
- Main Melody Extraction with Source-Filter NMF and CRNN☆25Apr 8, 2019Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year