chorowski-lab/CPC_audio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chorowski-lab/CPC_audio)

chorowski-lab / CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

☆10

Alternatives and similar repositories for CPC_audio

Users that are interested in CPC_audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

chorowski-lab / hCPC
View on GitHub
Implementation of multi-level Contrastive Predictive Coding (CPC) methods
☆20Jan 12, 2023Updated 3 years ago
felixkreuk / UnsupSeg
View on GitHub
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆146Aug 5, 2022Updated 3 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
Fraunhofer-AISEC / towards-resistant-audio-adversarial-examples
View on GitHub
Generation tool for offset-resistant audio adversarial examples against Deepspeech
☆10Oct 5, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago
AI-secure / Characterizing-Audio-Adversarial-Examples-using-Temporal-Dependency
View on GitHub
ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".
☆11Apr 3, 2019Updated 7 years ago
shaojinding / GroupLatentEmbedding
View on GitHub
Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…
☆28Sep 17, 2019Updated 6 years ago
jonnybluesman / mscom
View on GitHub
Music structure analysis with community detection methods
☆18Oct 24, 2019Updated 6 years ago
FilippoMB / Total-variation-graph-neural-networks
View on GitHub
Pytorch (PyG) and Tensorflow (Keras/Spektral) implementation of Total Variation Graph Neural Network (TVGNN), as presented at ICML 2023.
☆20Mar 15, 2025Updated last year
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
mmmmayi / ExPO
View on GitHub
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
☆14Mar 14, 2025Updated last year
wnhsu / ResDAVEnet-VQ
View on GitHub
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
☆28Feb 22, 2022Updated 4 years ago
danedane-haider / HybrA-Filterbanks
View on GitHub
A module for trainable encoder/decoder filterbanks with auditory bias.
☆17Feb 17, 2026Updated 5 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
speedyseal / audiosetdl
View on GitHub
Scripts for download AudioSet
☆89Nov 7, 2017Updated 8 years ago
AI-S2-Lab / GPT-Talker
View on GitHub
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆45Oct 28, 2024Updated last year
Rose-STL-Lab / symmetry-ode-discovery
View on GitHub
Official codebase for our NeurIPS paper, Symmetry-Informed Governing Equation Discovery.
☆11Nov 13, 2024Updated last year
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
daniel-ho / SegNBDT
View on GitHub
Making high-accuracy and visually-interpretable decision tree-based models for semantic segmentation http://segnbdt.aaalv.in
☆11Oct 12, 2021Updated 4 years ago
spatialdatasciencegroup / HST
View on GitHub
[NeurIPS '23] Official code of "A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space"
☆14Jul 13, 2025Updated last year
ihp-lab / Avec2019_DDS
View on GitHub
Baseline scripts for AVEC 2019, Depression Detection Sub-challenge
☆16Jul 11, 2019Updated 7 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
kamperh / vqwordseg
View on GitHub
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆39May 5, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
shehzeen / waveguard_defense
View on GitHub
This is the codebase for defense framework described in USENIX '21 paper "WaveGuard: Understanding and Mitigating Audio Adversarial Examp…
☆20Oct 20, 2021Updated 4 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
HXS572 / Depression_Recognition
View on GitHub
Depression Recognition
☆12Mar 11, 2024Updated 2 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
danielnflam / MTANN
View on GitHub
MTANN for rib bone suppression
☆10May 25, 2021Updated 5 years ago
marl / jams-data
View on GitHub
Datasets and parsing scripts for JAMS
☆27Feb 1, 2020Updated 6 years ago
tychovdo / noethers-razor
View on GitHub
Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…
☆10Oct 12, 2024Updated last year
TrustAI / testRNN
View on GitHub
Coverage-Guided Testing of Long Short-Term Memory (LSTM) Networks
☆18Dec 15, 2020Updated 5 years ago
xiaoningdu / deepstellar
View on GitHub
☆15Aug 5, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zizhaozhang / distill2
View on GitHub
☆12Jun 21, 2022Updated 4 years ago
GeorgeEfstathiadis / LLM-Diarize-ASR-Agnostic
View on GitHub
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
☆23Jul 31, 2024Updated last year
justinsalamon / musicseg_deepemb
View on GitHub
Code for paper: "Deep Embeddings and Section Fusion Improve Music Segmentation"
☆54Oct 10, 2022Updated 3 years ago
mkazhdan / ECHODescriptors
View on GitHub
Extended Convolution Histogram of Orientations
☆14Dec 7, 2021Updated 4 years ago
nttcslab / dcase2023_task2_evaluator
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
npuichigo / ttsflow
View on GitHub
tensorflow speech synthesis c++ inference for voicenet
☆16Mar 29, 2019Updated 7 years ago
retna319 / SMNN
View on GitHub
Scalable Monotonic Neural Networks
☆12Mar 14, 2024Updated 2 years ago