AENet: audio feature extraction
☆60Aug 30, 2019Updated 6 years ago
Alternatives and similar repositories for aenet
Users that are interested in aenet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and demos for our paper at ACM MM 2017☆62May 2, 2019Updated 6 years ago
- A dataset with user created GIFs☆48Oct 7, 2018Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆101Aug 10, 2017Updated 8 years ago
- Spectral audio feature extraction using time-frequency reassignment☆46Sep 26, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- a music segmentation algorithm that I proposed and implemented as my undergraduate project. The basic function is: a song is loaded to th…☆16Apr 19, 2013Updated 12 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- Documented code with instructions to reproduce results of paper submitted to ECML☆13Oct 11, 2018Updated 7 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Extracts the shot classes and generic visual features for a broadcast news video.☆13Jul 23, 2017Updated 8 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Code for replicating results in 'On Weight Initializations in Deep Neural Networks'☆10Apr 28, 2017Updated 8 years ago
- TensorFlow implementation of "SoundNet".☆145Mar 26, 2018Updated 8 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Sound event detection in real life audio with CNN submitted to DCASE16☆22Jun 10, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆56Oct 30, 2015Updated 10 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 8 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- ☆15Nov 6, 2017Updated 8 years ago
- Reference implementation for Structured Prediction with Deep Value Networks☆54Jul 10, 2017Updated 8 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pytorch implementation of Quadratic Additive Angular Margin Loss for Face Recognition☆35Nov 24, 2020Updated 5 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- ☆59Dec 13, 2017Updated 8 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- A generative model for Indian classical music using finite state machines☆14Jan 10, 2021Updated 5 years ago
- Keras implementation of the article "Solving internal covariate shift in deep learning with linked neurons"☆13Dec 8, 2017Updated 8 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago