taras-sereda / deep-learning-for-audioView external linksLinks
β17Jun 6, 2024Updated last year
Alternatives and similar repositories for deep-learning-for-audio
Users that are interested in deep-learning-for-audio are comparing it to the libraries listed below
Sorting:
- Dictionary of word stresses in the Ukrainian language πΊπ¦β22Sep 29, 2024Updated last year
- Dictionary of obscene words for Ukrainian languageβ22May 15, 2025Updated 8 months ago
- UNLP 2025 Shared Task on Detecting Social Media Manipulationβ23Aug 4, 2025Updated 6 months ago
- UCU Audio Processing Courseβ39Updated this week
- Speech Emotion Recognition using Deep Learningβ12May 24, 2021Updated 4 years ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesisβ40Sep 14, 2023Updated 2 years ago
- β15Apr 4, 2023Updated 2 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithmsβ14Feb 2, 2026Updated last week
- Dataset of Ukrainian handwritted letters with plenty of variationsβ12Apr 15, 2021Updated 4 years ago
- An experimental custom seq-2-seq model with both layer-wise (inter-layer), and intra-layer attention (attention to previous hidden statesβ¦β10Nov 30, 2017Updated 8 years ago
- acnn for text-independent speaker recognitionβ10Feb 8, 2022Updated 4 years ago
- A docker container skeleton for Flask micro-servicesβ11Mar 25, 2021Updated 4 years ago
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)β10Nov 2, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representationβ12Jan 27, 2023Updated 3 years ago
- A list of papers about concept bottleneck models (CBMs)β18Nov 12, 2025Updated 3 months ago
- PyTorch re-implementation of some papers on image captioning | εΎεζθΏ°β14Apr 22, 2021Updated 4 years ago
- Tools to isolate speaker and transcribe unstructured audio clipsβ11Dec 4, 2022Updated 3 years ago
- β15Oct 29, 2024Updated last year
- β10Apr 8, 2024Updated last year
- Official implementation of BPA (CVPR 2022)β13Jun 17, 2022Updated 3 years ago
- Practice for Machine Learning in Production courseβ13Jun 7, 2025Updated 8 months ago
- β12Oct 21, 2019Updated 6 years ago
- code and speech demo for speech reconstruction from ECoG recordingsβ12May 21, 2025Updated 8 months ago
- PyTorch implementation of A Neural Algorithm of Artistic Styleβ10Dec 20, 2019Updated 6 years ago
- Altered TCR Ligand Affinities and Structuresβ12Dec 1, 2023Updated 2 years ago
- Multimodal deep learning in neuroimagingβ14Jan 27, 2023Updated 3 years ago
- β14Mar 29, 2022Updated 3 years ago
- β11Jul 1, 2022Updated 3 years ago
- Open Source Crimean Tatar Text-to-Speech datasetsβ14Feb 23, 2025Updated 11 months ago
- β12Aug 12, 2021Updated 4 years ago
- Real-time melgan based on cpu οΌοΌοΌβ13Dec 3, 2019Updated 6 years ago
- Unsupervised feature learning for audio classification using convolutional deep belief networksβ12Jul 25, 2015Updated 10 years ago
- Detect when your pets want to go outsideβ11Apr 1, 2021Updated 4 years ago
- β13Jul 10, 2021Updated 4 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that useβ¦β10Jan 25, 2021Updated 5 years ago
- Learning associations between human faces and voicesβ12Feb 15, 2019Updated 6 years ago
- Example source for MongoDB / JavaScript snippetsβ27Mar 11, 2013Updated 12 years ago
- Fast model deployment on AWS EC2β14Feb 25, 2024Updated last year
- visual-text to speechβ14Apr 3, 2022Updated 3 years ago