IBM / MAX-Audio-Embedding-GeneratorLinks
Generate embedding vectors from audio files
☆59Updated 4 months ago
Alternatives and similar repositories for MAX-Audio-Embedding-Generator
Users that are interested in MAX-Audio-Embedding-Generator are comparing it to the libraries listed below
Sorting:
- Python library for audio augmentation☆85Updated 2 years ago
- Wrapper for pydub AudioSegment objects☆96Updated 3 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.☆56Updated 4 years ago
- Identify sounds in short audio clips☆156Updated 4 months ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Updated last year
- Train a Deep Learning model to classify audio embeddings on IBM's Deep Learning as a Service (DLaaS) platform - Watson Machine Learning☆102Updated 4 months ago
- A simple audio feature extraction library☆81Updated 6 years ago
- Audio preprocessing framework for Deep Learning audio applications☆129Updated 3 years ago
- python wrapper for rubberband☆211Updated last year
- ☆61Updated 2 years ago
- Real-time Audio time-scale and pitch modification in Python☆60Updated 6 years ago
- Python library for handling audio datasets.☆138Updated 2 years ago
- An in-browser app for labeling audio clips at random, using Docker and Flask.☆53Updated 8 years ago
- A deep learning framework for Speech-Music discrimination of continuous audio streams☆68Updated 7 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.☆119Updated 3 years ago
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆93Updated 7 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- ☆32Updated 4 years ago
- ☆44Updated last year
- Util code, issues, discussions☆29Updated 7 years ago
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆145Updated 3 years ago
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆143Updated 2 years ago
- 🔉 A web app to play, visualize, and annotate your audio files for machine learning☆120Updated 5 years ago
- A collection of python scripts for extracting and analyzing acoustics from audio files.☆101Updated 2 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 11 years ago
- A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable wo…☆70Updated 8 years ago
- An implementation of the Short Time Fourier Transform in pure TensorFlow☆62Updated 5 years ago
- museval - source separation evaluation tools for python☆231Updated 7 months ago