znaoya/aenet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/znaoya/aenet)

znaoya / aenet

AENet: audio feature extraction

☆60

Alternatives and similar repositories for aenet

Users that are interested in aenet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arunbalajeev / query-video-summary
View on GitHub
Code and demos for our paper at ACM MM 2017
☆62May 2, 2019Updated 7 years ago
gifs / personalized-highlights-dataset
View on GitHub
A dataset with user created GIFs
☆48Oct 7, 2018Updated 7 years ago
gyglim / video2gif_dataset
View on GitHub
The Video2GIF dataset with 100k GIFs from our paper at CVPR2016
☆99Aug 10, 2017Updated 8 years ago
bzamecnik / tfr
View on GitHub
Spectral audio feature extraction using time-frequency reassignment
☆50Sep 26, 2018Updated 7 years ago
torogmw / MusicSegmentation
View on GitHub
a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…
☆16Apr 19, 2013Updated 13 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
ikiskin / ECMLDeepAudio
View on GitHub
Documented code with instructions to reproduce results of paper submitted to ECML
☆13Oct 11, 2018Updated 7 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
batra-mlp-lab / avsd
View on GitHub
[CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog
☆34Feb 1, 2021Updated 5 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago
TUIlmenauAMS / FilterBanks_PythonKerasNeuralNetworkImplemention
View on GitHub
Filter Bank Implementaion as Convolutional Neural Network using Python Keras
☆17Dec 18, 2024Updated last year
sidkk86 / weight_initialization
View on GitHub
Code for replicating results in 'On Weight Initializations in Deep Neural Networks'
☆10Apr 28, 2017Updated 9 years ago
eborboihuc / SoundNet-tensorflow
View on GitHub
TensorFlow implementation of "SoundNet".
☆144Mar 26, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gorinars / dcase16-cnn
View on GitHub
Sound event detection in real life audio with CNN submitted to DCASE16
☆22Jun 10, 2022Updated 4 years ago
aliensunmin / DomainSpecificHighlight
View on GitHub
☆56Oct 30, 2015Updated 10 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
parakalan / RagaRecognition
View on GitHub
An attempt to recognise raga of a Carnatic song.
☆12Dec 24, 2022Updated 3 years ago
SiggiGue / sigfeat
View on GitHub
Feature Extraction from Signals e.g. for Audio Feature Extraction and Processing.
☆10Aug 21, 2019Updated 6 years ago
gyglim / dvn
View on GitHub
Reference implementation for Structured Prediction with Deep Value Networks
☆54Jul 10, 2017Updated 9 years ago
fgnt / LatticeWordSegmentation
View on GitHub
Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model
☆17Nov 24, 2016Updated 9 years ago
gshruti95 / news-shot-classification
View on GitHub
Extracts the shot classes and generic visual features for a broadcast news video.
☆13Jul 23, 2017Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
madvn / Carnatic_FSM
View on GitHub
A generative model for Indian classical music using finite state machines
☆14Jan 10, 2021Updated 5 years ago
qqueing / pytorch-G2P
View on GitHub
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Dec 17, 2017Updated 8 years ago
blauigris / linked_neurons
View on GitHub
Keras implementation of the article "Solving internal covariate shift in deep learning with linked neurons"
☆13Dec 8, 2017Updated 8 years ago
scanner-research / hwang
View on GitHub
Fast sparse video decode
☆33Jan 28, 2020Updated 6 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
bbjornstad / audio-feature-extraction
View on GitHub
A repository holding my personal implementations of audio feature extraction for environmental and musical auditory analysis and classifi…
☆14Dec 2, 2019Updated 6 years ago
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
BathVisArtData / PhotoArt50
View on GitHub
Photos and artwork images with object annotations for academic use only
☆28Oct 25, 2016Updated 9 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gyglim / personalized-highlights-dataset
View on GitHub
A dataset with user created GIFs
☆64Oct 4, 2018Updated 7 years ago
dansoutner / LSTMLM
View on GitHub
Simple LSTM language modelling toolkit
☆10Oct 21, 2022Updated 3 years ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
homink / speech.ko
View on GitHub
Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language
☆43Feb 28, 2018Updated 8 years ago
Rick-McCoy / ClassicGAN
View on GitHub
ClassicGAN: Generation of Classical Music with PGGAN
☆17Nov 24, 2018Updated 7 years ago
cvondrick / soundnet
View on GitHub
SoundNet: Learning Sound Representations from Unlabeled Video. NIPS 2016
☆466Oct 7, 2017Updated 8 years ago