Code to demonstrate multimodal LSTM
☆36Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for lstm_speaker_naming_aaai16
Users that are interested in lstm_speaker_naming_aaai16 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Vectorized multimodal LSTM using Matlab and GPU☆32Apr 19, 2016Updated 10 years ago
- Recurrent Neural Network Demo by PyBrain☆10Feb 2, 2015Updated 11 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- MATLAB functions that interface with the HTK Speech Recognition Toolkit (http://htk.eng.cam.ac.uk/) for training HMMs, GMMs and simple sp…☆46Jan 4, 2017Updated 9 years ago
- Code for UAI 2019 paper "Domain Generalization via Multidomain Discriminant Analysis"☆14Aug 28, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- C++ library for neural networks.☆39Feb 4, 2016Updated 10 years ago
- For FFL Blog☆10Sep 24, 2015Updated 10 years ago
- Modular Restricted Boltzmann Machine (RBM) implementation using Theano☆174Feb 21, 2013Updated 13 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Singing-Voice Separation From Monaural Recordings Using Robust Principal Component Analysis☆67Nov 26, 2020Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Visibility graphs for robust harmonic similarity measures between audio spectra☆15Apr 29, 2020Updated 6 years ago
- 基于html5实现的视频播放器,提供PSD源文件☆47Sep 23, 2015Updated 10 years ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Aug 20, 2024Updated last year
- A neural network for end-to-end music source separation☆24Oct 31, 2018Updated 7 years ago
- Implementation of the ConvS2S architecture using TensorFlow. Also includes the BiConvS2S for bidirectional sequence-to-sequence generatio…☆10May 14, 2019Updated 6 years ago
- ☆29May 22, 2015Updated 10 years ago
- ☆10Mar 4, 2016Updated 10 years ago
- ☆22Nov 19, 2018Updated 7 years ago
- Face detection using Multi-scale Block Local Binary Pattern algorithm - optimized with OpenCL/OpenMP - Depreciated - pls use convolutiona…☆11Jul 16, 2017Updated 8 years ago
- Pythonic access to audio files☆60Dec 4, 2024Updated last year
- Look Ahead Hamiltonian Monte Carlo☆31Mar 29, 2015Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The implementation of Word2Vec (SkipGram - and CBOW) models using theano and numpy☆27Jun 3, 2016Updated 9 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 6 months ago
- Rhythm Pattern music feature extractor by IFS @ TU-Vienna☆115Sep 11, 2018Updated 7 years ago
- Python toolkit for likelihood-ratio calibration of binary classifiers☆25Feb 21, 2023Updated 3 years ago
- ☆13Sep 16, 2016Updated 9 years ago
- Content Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)☆55Apr 18, 2019Updated 7 years ago
- This is a Javascript toolbox to perform online rating studies with auditory material.☆18Nov 18, 2024Updated last year
- MXNet finetune baseline (res152) for challenger.ai/competition/scene☆11Sep 24, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- tensorflow implementation of 'Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer'☆35Jul 31, 2017Updated 8 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆20Jul 17, 2023Updated 2 years ago
- codes for: Modality to Modality Translation: An Adversarial Representation Learning and Graph Fusion Network for Multimodal Fusion☆48Sep 1, 2021Updated 4 years ago
- A web application that recommends songs via "country arithmetic" and hand-rolled Implicit Matrix Factorization☆10May 5, 2017Updated 8 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 5 years ago