tyiannak/deep_audio_features

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tyiannak/deep_audio_features)

tyiannak / deep_audio_features

Pytorch implementation of deep audio embedding calculation

☆106

Alternatives and similar repositories for deep_audio_features

Users that are interested in deep_audio_features are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tyiannak / readys
View on GitHub
A Speech Analytics Python Tool for Speaking Assessment
☆13Dec 8, 2022Updated 3 years ago
tyiannak / paura
View on GitHub
Python AUdio Recording and Analysis (paura)
☆226Jul 6, 2023Updated 3 years ago
tyiannak / pyAudioAnalysis
View on GitHub
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
☆6,255Aug 4, 2025Updated 11 months ago
tyiannak / multimodal_movie_analysis
View on GitHub
A Python Library for Multimodal Analysis of Movies and Content-based Movie Recommendation
☆31Jan 18, 2022Updated 4 years ago
tyiannak / basic_audio_analysis
View on GitHub
☆37Nov 19, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
trumpepsteinaudio / trumpepsteinaudio
View on GitHub
☆16Sep 28, 2020Updated 5 years ago
pxaris / lyra-dataset
View on GitHub
Lyra - A Dataset for Greek Traditional and Folk Music
☆23Jun 27, 2023Updated 3 years ago
tyiannak / multimodalAnalysis
View on GitHub
Python examples for the course "Multimodal Information Processing & Analysis" of the MSc in Data Science in NCSR Demokritos
☆98Jul 6, 2023Updated 3 years ago
tyiannak / AUROS
View on GitHub
A ROS framework for Audio Analysis
☆12Apr 5, 2017Updated 9 years ago
tyiannak / basic_audio_handling
View on GitHub
A set of examples for basic audio data handling
☆13Aug 15, 2020Updated 5 years ago
theopsall / multiSmote
View on GitHub
A multi-label approach of the SMOTE algorithm
☆12Aug 6, 2024Updated last year
wayne391 / sf_segmenter
View on GitHub
Music segmentation algorithm, based on SF (structural feature)
☆57Feb 8, 2023Updated 3 years ago
tyiannak / pyTextClassification
View on GitHub
Training and using classifiers for textual documents
☆15Sep 16, 2016Updated 9 years ago
bastibe / python-oscillator
View on GitHub
See what your sound card is doing in real time
☆14Jun 13, 2016Updated 10 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aisu-programming / Chord-Recognition
View on GitHub
Recognize chords in songs using Bidirectional Transformer || AI Cup 2020 - Chord Recognition Competition (9th place) / 和弦辨識競賽 (第九名)
☆20Apr 4, 2024Updated 2 years ago
tyiannak / color_your_music_mood
View on GitHub
A realtime demo for generating colors based on musical moods
☆50Jul 6, 2023Updated 3 years ago
nhattruongpham / mmser
View on GitHub
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
christofw / pitchclass_mctc
View on GitHub
Pytorch project accompanying the paper "Training Deep Pitch-Class Representations With a Multi-Label CTC Loss", ISMIR 2021.
☆26Mar 24, 2022Updated 4 years ago
tyiannak / python-data-science
View on GitHub
Introduction to Python for Data Science
☆13Oct 11, 2024Updated last year
gulnazaki / meowify
View on GitHub
A web app and flask server to turn vocals from any youtube song to meows!
☆13Jan 8, 2021Updated 5 years ago
janclemenslab / das_unsupervised
View on GitHub
Deep Audio Segmenter, unsupervised
☆10Feb 20, 2026Updated 5 months ago
cgaroufis / MSCOL_SMC23
View on GitHub
Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in S…
☆17Nov 13, 2023Updated 2 years ago
theopsall / Video-Summarization
View on GitHub
Multimodal summarization of user-generated videos from wearable cameras
☆23Jun 22, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
deeuu / loudness
View on GitHub
Audio library for modelling loudness
☆40Aug 8, 2019Updated 6 years ago
MTG / compIAM
View on GitHub
Common tools for the computational analysis of Indian Art Music
☆36Feb 19, 2026Updated 5 months ago
Nikolay-Lysenko / geniartor
View on GitHub
Generation of musical phrases that receive maximum score according to configurable evaluational criteria.
☆12Oct 17, 2023Updated 2 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
YashNita / sound-event-detection-winning-method
View on GitHub
☆27Apr 5, 2019Updated 7 years ago
01tot10 / neural-tape-modeling
View on GitHub
Neural Modeling of Magnetic Tape Recorders
☆27Oct 28, 2023Updated 2 years ago
prajwalkr / transpotter
View on GitHub
Official implementation of Transpotter, published in BMVC 2021
☆16Aug 6, 2022Updated 3 years ago
ropensci / ohun
View on GitHub
Automatic detection of acoustic signals
☆18Oct 30, 2025Updated 8 months ago
Nikolay-Lysenko / dodecaphony
View on GitHub
Algorithmic composition of modern classical music in the twelve-tone technique.
☆13May 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
llxlr / Speech-Recognition-With-Python
View on GitHub
Speech Recognition With Python | python语音识别
☆21Jul 22, 2022Updated 4 years ago
Kikyo-16 / A-unified-model-for-zero-shot-musical-source-separation-transcription-and-synthesis
View on GitHub
☆40Sep 28, 2022Updated 3 years ago
sungnyun / avsr-temporal-dynamics
View on GitHub
(SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition
☆13Oct 22, 2024Updated last year
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,162Nov 24, 2025Updated 8 months ago
micah5 / pyAudioClassification
View on GitHub
🎶 dead simple audio classification
☆138Nov 14, 2019Updated 6 years ago
marathomas / tutorial_repo
View on GitHub
Tutorial for generating and evaluating latent-space representations of vocalizations using UMAP
☆15May 1, 2022Updated 4 years ago
cegeme / iracema
View on GitHub
☆16Mar 25, 2023Updated 3 years ago