This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition
☆10Jan 25, 2021Updated 5 years ago
Alternatives and similar repositories for unsupervised-speech-representation-learning
Users that are interested in unsupervised-speech-representation-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A chatbot using the Vaswani transformer as it's sequence-to-sequence module☆22Jul 27, 2023Updated 2 years ago
- Tree visualization of the AudioSet Ontology - https://github.com/audioset/ontology☆18Aug 8, 2024Updated last year
- ☆14Jan 9, 2025Updated last year
- Demonstration of using Caffe2 inside an Android application.☆10Dec 23, 2018Updated 7 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Hyper-AdaC: Adaptive clustering-based hypergraph representation of whole slide images for survival analysis☆16Nov 28, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data☆18Apr 3, 2025Updated 11 months ago
- This repository contains the material for deploying deep learning models on mobile and embedded platforms☆11Jan 28, 2018Updated 8 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Automatic Speech Recognition (ASR) - German☆18Jul 3, 2020Updated 5 years ago
- ☆13Aug 4, 2021Updated 4 years ago
- Automatic Speech Recognition (ASR) - German☆22Aug 26, 2019Updated 6 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- deeplearning.ai is the complete course on Deep Learning on Coursera. The instructor of this course is Andrew Ng. Programming assignments…☆12Jul 6, 2018Updated 7 years ago
- An application that helps in generating TMorph codes for WoW.☆10Mar 10, 2016Updated 10 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆17Apr 17, 2025Updated 11 months ago
- Some of my microboard projects.☆13Nov 30, 2017Updated 8 years ago
- Pytorch implementation for Meta-SPL (self-paced learning).☆18Jul 8, 2020Updated 5 years ago
- The Dialogflow Fulfillment Builder is a library that helps you to build the responses with ease in order to connect your Dialogflow agent…☆11Mar 5, 2023Updated 3 years ago
- Boilerplate to bridge the absence of a framework and support Dialogflow Fulfillment implementation for multiple platforms by building a W…☆11Mar 8, 2022Updated 4 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Introductory Notebooks on Machine Learning topics.☆34Oct 27, 2025Updated 4 months ago
- ☆30Mar 15, 2024Updated 2 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆10Dec 14, 2017Updated 8 years ago
- pYIN pitch detection implementation with librosa and python 3☆14Jul 16, 2019Updated 6 years ago
- Github mirror of MediaWiki extension Cargo - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_…☆34Updated this week
- ☆14Apr 29, 2015Updated 10 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Iteratively Coupled Multiple Instance Learning☆24Nov 28, 2024Updated last year
- [ICML2025] An open source multi-modal codebase, Official code for "Towards the Causal Complete Cause of Multi-Modal Representation Learni…☆49Sep 28, 2025Updated 5 months ago
- A Rideshare Simulation built in C++, using OpenStreetMap data☆14Oct 24, 2021Updated 4 years ago
- This script converts arxiv papers into a certain markdown format.☆18May 19, 2023Updated 2 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.☆39Sep 30, 2020Updated 5 years ago