bootphon/features_extraction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bootphon/features_extraction)

bootphon / features_extraction

audio cfeatures extraction tool from wav to h5features format

☆19

Alternatives and similar repositories for features_extraction

Users that are interested in features_extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bootphon / ABXpy
View on GitHub
ABX discrimination task in python
☆45Oct 7, 2024Updated last year
G-Wang / Text2Speech-Pytorch
View on GitHub
A Text2Speech Engine built in Pytorch.
☆12Dec 9, 2018Updated 7 years ago
omangin / multimodal
View on GitHub
A set of tools and experimental scripts used to achieve multimodal learning with nonnegative matrix factorization (NMF).
☆18Jul 22, 2016Updated 9 years ago
oliviaguest / compcog.science
View on GitHub
http://compcog.science
☆13Mar 30, 2025Updated last year
ivmartel / dwv-orthanc-plugin
View on GitHub
Orthanc plugin for dwv.
☆16Jan 13, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
PengdaLiu / LAS-SpeechRecognition
View on GitHub
Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).
☆32Jun 27, 2019Updated 7 years ago
bootphon / wordseg
View on GitHub
A Python toolbox for text based word segmentation
☆19Jan 27, 2021Updated 5 years ago
UFAL-DSG / pyfst
View on GitHub
A Python interface to OpenFst
☆14Jun 4, 2019Updated 7 years ago
bootphon / abnet3
View on GitHub
Siamese network for unsupervised speech representation learning
☆11Oct 12, 2018Updated 7 years ago
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 8 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
rupakvignesh / Singing-Voice-Detection
View on GitHub
Term Project at GTCMT exploring phase based features for Singing Voice Detection with Neural Networks
☆11Apr 20, 2018Updated 8 years ago
Charleswyt / MP3Stego
View on GitHub
A forked opensource stego tool, primary URL: http://www.petitcolas.net/steganography/mp3stego/
☆10Dec 5, 2018Updated 7 years ago
emited / gantk2
View on GitHub
GAN(TK)²: GAN Neural Tangent Kernel ToolKit
☆13Jul 12, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mmadsen / lnraw
View on GitHub
Lab Notebook - Source Version
☆34Dec 8, 2020Updated 5 years ago
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
zhepeiw / cssl_sound
View on GitHub
☆14Jan 17, 2023Updated 3 years ago
tobiasfshr / gmm-ubm-speaker-identification-verification
View on GitHub
Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…
☆21Mar 1, 2018Updated 8 years ago
JasonSWFu / JD-NMF
View on GitHub
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
☆22Oct 14, 2017Updated 8 years ago
caippx / AzurePortalPCE
View on GitHub
☆12Jul 22, 2021Updated 4 years ago
edezhic / fashion-generator
View on GitHub
In-browser GPU-accelerated Generative Adversarial Network trained on Fashion-MNIST dataset (tensorflow + deeplearn.js)
☆11Aug 28, 2018Updated 7 years ago
255BITS / vocal-autoencoder
View on GitHub
☆12May 12, 2016Updated 10 years ago
IoSR-Surrey / IoSR_ListeningRoom_BRIRs
View on GitHub
The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…
☆22Mar 24, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rronan / IntPhys-Baselines
View on GitHub
Code for paper "IntPhys: A Benchmark and Dataset for Intuitive Physics".
☆30Nov 18, 2019Updated 6 years ago
RajsimmanRavi / UBA_OSSEC
View on GitHub
User Behavior Analysis using OSSEC on cloud infrastructures
☆10Feb 27, 2017Updated 9 years ago
mravanelli / theano-kaldi-rnn
View on GitHub
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…
☆34Apr 15, 2018Updated 8 years ago
TUT-ARG / DCASE2016-baseline-system-python
View on GitHub
DCASE 2016 Baseline system, python implementation
☆53Jul 20, 2017Updated 9 years ago
messiaen / full-lattice-search
View on GitHub
Full Text Search Over Probabilistic Lattices with Elasticsearch!
☆10Nov 20, 2020Updated 5 years ago
open-speech / tf_kaldi_io
View on GitHub
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
☆40Nov 26, 2018Updated 7 years ago
ansleliu / ConvolutionaNeuralNetworksToEnhanceCodedSpeech
View on GitHub
In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…
☆28Mar 8, 2020Updated 6 years ago
david-ds / adventofcode-2020
View on GitHub
Solutions for https://adventofcode.com/2020
☆12Nov 3, 2024Updated last year
posenhuang / singingvoiceseparationrpca
View on GitHub
Singing-Voice Separation From Monaural Recordings Using Robust Principal Component Analysis
☆66Nov 26, 2020Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
swshon / multi-speakerID
View on GitHub
☆30Nov 9, 2018Updated 7 years ago
morelen17 / tts-papers
View on GitHub
List of papers about TTS / Список статей о TTS
☆10Dec 16, 2017Updated 8 years ago
zweiein / End_to_end_Speech_Papers
View on GitHub
☆13Sep 12, 2017Updated 8 years ago
trneedham / QuantizedGromovWasserstein
View on GitHub
Scalable framework for comparing metric measure spaces with up to 1M points.
☆16Apr 6, 2021Updated 5 years ago
GavinGY / OneThing
View on GitHub
2019
☆11Aug 11, 2018Updated 7 years ago
bolin-chen / audio-steganalysis-cnn
View on GitHub
An audio steganalysis method based on CNN in the time domain.
☆12Feb 25, 2021Updated 5 years ago
danieltan07 / tensorflow-cycle-gan
View on GitHub
This is my attempt at recreating the CycleGAN paper: https://arxiv.org/pdf/1703.10593.pdf
☆12Apr 13, 2017Updated 9 years ago