An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
Alternatives and similar repositories for CPC_audio
Users that are interested in CPC_audio are comparing it to the libraries listed below
Sorting:
- Implementation of multi-level Contrastive Predictive Coding (CPC) methods☆20Jan 12, 2023Updated 3 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year
- Scripts for download AudioSet☆86Nov 7, 2017Updated 8 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated last year
- ☆56Feb 17, 2026Updated 2 weeks ago
- N. Papadakis, G. Peyré, E. Oudet. Optimal Transport with Proximal Splitting. SIAM Journal on Imaging Sciences, 7(1), pp. 212–238, 2014.☆11Jan 7, 2017Updated 9 years ago
- Object-Oriented Programming II☆12Jul 23, 2021Updated 4 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated 2 months ago
- react-d3 brush implementation☆12Jul 26, 2024Updated last year
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- A full-stack boilerplate leveraging Nuxt 2.x and Feathers 3.x.☆11Nov 25, 2018Updated 7 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- ☆13Oct 25, 2024Updated last year
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- Crossplatform dock icon implementation☆28Mar 6, 2014Updated 11 years ago
- SongDriver uses a parallel mechanism of prediction and arrangement phases to achieve zero logical latency in real-time accompaniment gene…☆14Jan 5, 2026Updated last month
- SongDriver2 achieves a balance between real-time emotion fit and soft transitions, enhancing the coherence of the generated music.☆11Nov 15, 2025Updated 3 months ago
- Towards an implementation of hierarchical temporal memory and the cortical learning algorithm by Jeff Hawkins and Dileep George of Nument…☆12Mar 15, 2017Updated 8 years ago
- QQ自动登录、点赞、留言、定时发送消息、消息轰炸☆11Jan 29, 2021Updated 5 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- React 农历日历☆10May 5, 2015Updated 10 years ago
- Embed computation and tldraw canvas in your markdown☆14Jan 30, 2024Updated 2 years ago
- Exploring animations with in Meteor using the Blaze _uihooks☆10Jan 19, 2015Updated 11 years ago
- Clojure library for Blueprints (part of the Tinkerpop graph stack).☆38Sep 6, 2022Updated 3 years ago
- ☆10Feb 19, 2021Updated 5 years ago
- KaTeX + coloring + interactivity to make equations explained well (prototype)☆19Dec 31, 2025Updated 2 months ago
- The original Vue Webpack, with Veux, Pug, CoffeeScript and Stylus added☆12May 29, 2024Updated last year
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- SyzgyDB: An embeddable vector database in Go for efficient disk-based storage and similarity searches, supporting various distance metric…☆10Nov 1, 2024Updated last year