This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition
☆10Jan 25, 2021Updated 5 years ago
Alternatives and similar repositories for unsupervised-speech-representation-learning
Users that are interested in unsupervised-speech-representation-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A chatbot using the Vaswani transformer as it's sequence-to-sequence module☆22Jul 27, 2023Updated 2 years ago
- Tree visualization of the AudioSet Ontology - https://github.com/audioset/ontology☆18Aug 8, 2024Updated last year
- ☆14Jan 9, 2025Updated last year
- Demonstration of using Caffe2 inside an Android application.☆10Dec 23, 2018Updated 7 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Hyper-AdaC: Adaptive clustering-based hypergraph representation of whole slide images for survival analysis☆16Nov 28, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- [MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data☆18Apr 3, 2025Updated last year
- This repository contains the material for deploying deep learning models on mobile and embedded platforms☆11Jan 28, 2018Updated 8 years ago
- ☆12Jun 5, 2018Updated 7 years ago
- Automatic Speech Recognition (ASR) - German☆18Jul 3, 2020Updated 5 years ago
- ☆13Aug 4, 2021Updated 4 years ago
- ☆21Apr 6, 2021Updated 5 years ago
- An application that helps in generating TMorph codes for WoW.☆10Mar 10, 2016Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- deeplearning.ai is the complete course on Deep Learning on Coursera. The instructor of this course is Andrew Ng. Programming assignments…☆12Jul 6, 2018Updated 7 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Automatic Speech Recognition (ASR) - German☆23Aug 26, 2019Updated 6 years ago
- ☆17Apr 17, 2025Updated 11 months ago
- Some of my microboard projects.☆13Nov 30, 2017Updated 8 years ago
- The Dialogflow Fulfillment Builder is a library that helps you to build the responses with ease in order to connect your Dialogflow agent…☆11Mar 5, 2023Updated 3 years ago
- Boilerplate to bridge the absence of a framework and support Dialogflow Fulfillment implementation for multiple platforms by building a W…☆11Mar 8, 2022Updated 4 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆32Jun 19, 2022Updated 3 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Introductory Notebooks on Machine Learning topics.☆34Oct 27, 2025Updated 5 months ago
- ☆30Mar 15, 2024Updated 2 years ago
- Pytorch implementation for Meta-SPL (self-paced learning).☆18Jul 8, 2020Updated 5 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- An implementation of Neural Style Transfer for Audio using Pytorch.☆11Dec 14, 2017Updated 8 years ago
- pYIN pitch detection implementation with librosa and python 3☆14Jul 16, 2019Updated 6 years ago
- Github mirror of MediaWiki extension Cargo - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_…☆34Updated this week
- ☆14Apr 29, 2015Updated 10 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Iteratively Coupled Multiple Instance Learning☆24Nov 28, 2024Updated last year
- This script converts arxiv papers into a certain markdown format.☆18May 19, 2023Updated 2 years ago
- [ICML2025] An open source multi-modal codebase, Official code for "Towards the Causal Complete Cause of Multi-Modal Representation Learni…☆63Sep 28, 2025Updated 6 months ago
- A Rideshare Simulation built in C++, using OpenStreetMap data☆14Oct 24, 2021Updated 4 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago