Pytorch implementation of deep audio embedding calculation
☆107Jul 23, 2023Updated 2 years ago
Alternatives and similar repositories for deep_audio_features
Users that are interested in deep_audio_features are comparing it to the libraries listed below
Sorting:
- Python AUdio Recording and Analysis (paura)☆227Jul 6, 2023Updated 2 years ago
- A Speech Analytics Python Tool for Speaking Assessment☆14Dec 8, 2022Updated 3 years ago
- Pytorch project accompanying the paper "Training Deep Pitch-Class Representations With a Multi-Label CTC Loss", ISMIR 2021.☆25Mar 24, 2022Updated 3 years ago
- Towards Understanding Deep Learning Representations via Interactive Experimentation☆24May 5, 2017Updated 8 years ago
- ☆36Nov 19, 2020Updated 5 years ago
- Music segmentation algorithm, based on SF (structural feature)☆57Feb 8, 2023Updated 3 years ago
- A demo of using @magenta/music as a dev-dependency in a TypeScript project☆22Jan 7, 2023Updated 3 years ago
- Timedomain-Ai-Singer omniverse插件☆20Feb 15, 2023Updated 3 years ago
- Audio library for modelling loudness☆40Aug 8, 2019Updated 6 years ago
- A demo using Soundfonts to play music within JUCE.☆12Nov 1, 2023Updated 2 years ago
- ☆40Sep 28, 2022Updated 3 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Tools to convert sigsep mus dataset from STEMS <-> WAV☆11Jul 15, 2020Updated 5 years ago
- Tools for use with the Macaque Face 3D stimulus set - a parameterized digital 3D model of the Rhesus macaque face☆11Apr 27, 2022Updated 3 years ago
- Deep Audio Segmenter, unsupervised☆10Feb 20, 2026Updated 2 weeks ago
- GSoC 2019: Development of a Tool for Extracting Quantitative Text Profiles☆11Jul 7, 2020Updated 5 years ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- Generation of musical phrases that receive maximum score according to configurable evaluational criteria.☆12Oct 17, 2023Updated 2 years ago
- Common tools for the computational analysis of Indian Art Music☆33Feb 19, 2026Updated 2 weeks ago
- Paper Name: Complex Convolution Neural Network model (Complex DeepLab v3) on STFT time-varying frequency components for audio denoising …☆12Dec 22, 2022Updated 3 years ago
- ☆11Dec 22, 2020Updated 5 years ago
- Conditional Similarity Networks (CSNs-Tensorflow)☆10Oct 29, 2018Updated 7 years ago
- Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning☆22Feb 16, 2023Updated 3 years ago
- Tools to run experiments around large scale cover detection.☆28Sep 30, 2022Updated 3 years ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- Bayesian Statistics Guide☆17Jan 9, 2022Updated 4 years ago
- Algorithmic composition of modern classical music in the twelve-tone technique.☆13May 10, 2025Updated 9 months ago
- Tutorial for generating and evaluating latent-space representations of vocalizations using UMAP☆14May 1, 2022Updated 3 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Apr 10, 2014Updated 11 years ago
- Audio Keyword Search☆12May 5, 2019Updated 6 years ago
- user defined keyword tags for beets☆19Mar 11, 2024Updated last year
- A Python Tool for Analysis of Mouse Vocal Communication☆17Mar 6, 2024Updated 2 years ago
- A re-implementation of the Wavelets package using Cython to improve the speed.☆13Jan 17, 2021Updated 5 years ago
- Music structure segmentation based on shift-invariant probabilistic latent component analysis of chroma☆42Oct 28, 2010Updated 15 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Feb 16, 2024Updated 2 years ago
- Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included…☆40Sep 5, 2022Updated 3 years ago
- SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings☆15Jan 23, 2024Updated 2 years ago
- This project aims to estimate the tempo (in BPM or beats per minute), the locations of the beats and downbeats of a song in the genre of …☆15Jan 24, 2018Updated 8 years ago
- ☆16Sep 28, 2020Updated 5 years ago