An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for CPC_audio
Users that are interested in CPC_audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- Pytorch (PyG) and Tensorflow (Keras/Spektral) implementation of Total Variation Graph Neural Network (TVGNN), as presented at ICML 2023.☆20Mar 15, 2025Updated last year
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 6 years ago
- Music structure analysis with community detection methods☆18Oct 24, 2019Updated 6 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 3 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- QQ自动登录、点赞、留言、定时发送消息、消息轰炸☆12Jan 29, 2021Updated 5 years ago
- Scripts for download AudioSet☆87Nov 7, 2017Updated 8 years ago
- Official codebase for our NeurIPS paper, Symmetry-Informed Governing Equation Discovery.☆11Nov 13, 2024Updated last year
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year
- [NeurIPS '23] Official code of "A Hierarchical Spatial Transformer for Massive Point Samples in Continuous Space"☆13Jul 13, 2025Updated 8 months ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated 2 years ago
- Baseline scripts for AVEC 2019, Depression Detection Sub-challenge☆16Jul 11, 2019Updated 6 years ago
- Making high-accuracy and visually-interpretable decision tree-based models for semantic segmentation http://segnbdt.aaalv.in☆11Oct 12, 2021Updated 4 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Code for NeurIPS 2024 paper: "Noether's razor: Learning Conserved Quantities" by Tycho F. A. van der Ouderaa, Mark van der Wilk, Pim de H…☆10Oct 12, 2024Updated last year
- Depression Recognition☆12Mar 11, 2024Updated 2 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆369Oct 12, 2021Updated 4 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- Coverage-Guided Testing of Long Short-Term Memory (LSTM) Networks☆18Dec 15, 2020Updated 5 years ago
- MTANN for rib bone suppression☆10May 25, 2021Updated 4 years ago
- ☆15Aug 5, 2020Updated 5 years ago
- Extended Convolution Histogram of Orientations☆14Dec 7, 2021Updated 4 years ago
- Code for paper: "Deep Embeddings and Section Fusion Improve Music Segmentation"☆53Oct 10, 2022Updated 3 years ago
- This is the codebase for defense framework described in USENIX '21 paper "WaveGuard: Understanding and Mitigating Audio Adversarial Examp…☆21Oct 20, 2021Updated 4 years ago
- Datasets and parsing scripts for JAMS☆27Feb 1, 2020Updated 6 years ago
- Repository for "LLM-based speaker diarization correction: A generalizable approach" paper☆21Jul 31, 2024Updated last year
- Scalable Monotonic Neural Networks☆12Mar 14, 2024Updated 2 years ago
- ☆12Jun 21, 2022Updated 3 years ago
- PyTorch implementation of Deep Spectral Clustering paper on a toy dataset☆25Jun 15, 2018Updated 7 years ago
- An Easy and Unified Interface for Robots (and Grippers, etc.)☆13Nov 7, 2024Updated last year