clam004/unsupervised-speech-representation-learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/clam004/unsupervised-speech-representation-learning)

clam004 / unsupervised-speech-representation-learning

This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that uses CPC to learn representations of sound files for the purpose of speech recognition

☆10

Alternatives and similar repositories for unsupervised-speech-representation-learning

Users that are interested in unsupervised-speech-representation-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

clam004 / chat-transformer
View on GitHub
A chatbot using the Vaswani transformer as it's sequence-to-sequence module
☆22Jul 27, 2023Updated 2 years ago
jordipons / AudioSetOntologyTree
View on GitHub
Tree visualization of the AudioSet Ontology - https://github.com/audioset/ontology
☆18Aug 8, 2024Updated last year
warmestwind / ABHG
View on GitHub
☆14Jan 9, 2025Updated last year
soumith / AICamera
View on GitHub
Demonstration of using Caffe2 inside an Android application.
☆10Dec 23, 2018Updated 7 years ago
samsad35 / source-filter-vae
View on GitHub
[SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder
☆46Apr 18, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HakimBenkirane / Hyper-adaC
View on GitHub
Hyper-AdaC: Adaptive clustering-based hypergraph representation of whole slide images for survival analysis
☆16Nov 28, 2022Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
imohitch / DeepLearningOnAndroid
View on GitHub
This repository contains the material for deploying deep learning models on mobile and embedded platforms
☆11Jan 28, 2018Updated 8 years ago
AjeetGitHub2016 / deeplearning.ai
View on GitHub
deeplearning.ai is the complete course on Deep Learning on Coursera. The instructor of this course is Andrew Ng. Programming assignments…
☆12Jul 6, 2018Updated 8 years ago
Quantiphi / webhook-boilerplate
View on GitHub
Boilerplate to bridge the absence of a framework and support Dialogflow Fulfillment implementation for multiple platforms by building a W…
☆10Mar 8, 2022Updated 4 years ago
DanBmh / deepspeech-german
View on GitHub
Automatic Speech Recognition (ASR) - German
☆18Jul 3, 2020Updated 6 years ago
karndeb / NLP-Service
View on GitHub
☆13Aug 4, 2021Updated 4 years ago
jongwook / crepe
View on GitHub
☆12Jun 5, 2018Updated 8 years ago
msh9184 / contrastive-equilibrium-learning
View on GitHub
☆21Apr 6, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
AASHISHAG / asr-german
View on GitHub
Automatic Speech Recognition (ASR) - German
☆23Aug 26, 2019Updated 6 years ago
kdar / morphgen
View on GitHub
An application that helps in generating TMorph codes for WoW.
☆10Mar 10, 2016Updated 10 years ago
ML4DS / ML4all
View on GitHub
Introductory Notebooks on Machine Learning topics.
☆34Oct 27, 2025Updated 8 months ago
warmestwind / RAPNet
View on GitHub
☆17Apr 17, 2025Updated last year
gorgitko / microboard-projects
View on GitHub
Some of my microboard projects.
☆13Nov 30, 2017Updated 8 years ago
Quantiphi / dialogflow-fulfillment-builder
View on GitHub
The Dialogflow Fulfillment Builder is a library that helps you to build the responses with ease in order to connect your Dialogflow agent…
☆11Mar 5, 2023Updated 3 years ago
lucidrains / tranception-pytorch
View on GitHub
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Jun 19, 2022Updated 4 years ago
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xjtushujun / Meta-SPL
View on GitHub
Pytorch implementation for Meta-SPL (self-paced learning).
☆19Jul 8, 2020Updated 6 years ago
rguthrie3 / DeepDependencyParsingProblemSet
View on GitHub
A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch
☆15Aug 12, 2017Updated 8 years ago
Lucas-rbnt / DRIM
View on GitHub
[MICCAI 2024] DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data
☆20Apr 3, 2025Updated last year
Dootmaan / ICMIL
View on GitHub
Iteratively Coupled Multiple Instance Learning
☆22Nov 28, 2024Updated last year
muhdhuz / Audio_NeuralStyle
View on GitHub
An implementation of Neural Style Transfer for Audio using Pytorch.
☆11Dec 14, 2017Updated 8 years ago
wikimedia / mediawiki-extensions-Cargo
View on GitHub
Github mirror of MediaWiki extension Cargo - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_…
☆35Updated this week
Anaesthesiaye / sound_event_detection_transformer
View on GitHub
code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)
☆46May 9, 2022Updated 4 years ago
xiaoch2004 / librosa_py3_pYIN
View on GitHub
pYIN pitch detection implementation with librosa and python 3
☆14Jul 16, 2019Updated 7 years ago
bedapudi6788 / txt2txt
View on GitHub
Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.
☆39Sep 30, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kamya-ai / Realtime-speech-detection
View on GitHub
Welcome to the Real-Time Voice Activity Detection (VAD) program, powered by Silero-VAD model! 🚀 This program allows you to perform live …
☆12Jul 9, 2023Updated 3 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆15Feb 24, 2025Updated last year
prosodylab / prosodylab.alignertools
View on GitHub
☆14Apr 29, 2015Updated 11 years ago
mvirgo / Rideshare-Simulation
View on GitHub
A Rideshare Simulation built in C++, using OpenStreetMap data
☆14Oct 24, 2021Updated 4 years ago
AASHISHAG / DeepSpeech-API
View on GitHub
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
☆32Jan 4, 2023Updated 3 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago