☆17Jun 6, 2024Updated last year
Alternatives and similar repositories for deep-learning-for-audio
Users that are interested in deep-learning-for-audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dictionary of obscene words for Ukrainian language☆23May 15, 2025Updated 11 months ago
- UNLP 2025 Shared Task on Detecting Social Media Manipulation☆23Aug 4, 2025Updated 9 months ago
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆22Sep 29, 2024Updated last year
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated last year
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆40Sep 14, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Oct 29, 2024Updated last year
- num_workers Search Algorithm for Fast PyTorch DataLoader☆23Jul 29, 2021Updated 4 years ago
- UCU Audio Processing Course☆42Apr 27, 2026Updated last week
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Jul 15, 2025Updated 9 months ago
- OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline☆41Apr 10, 2026Updated 3 weeks ago
- Ukrainian TTS (text-to-speech) using ESPNET☆240Mar 8, 2025Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Tools to isolate speaker and transcribe unstructured audio clips☆11Dec 4, 2022Updated 3 years ago
- ☆15May 31, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of "Face detection in untrained deep neural networks" (Baek et al., Nature Communications, 2021)☆10Nov 2, 2021Updated 4 years ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 3 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- ☆32Aug 4, 2021Updated 4 years ago
- [DEPRECATED] Adds a Profiler tab to gather statistics about Doctrine queries made during a request☆58Jun 5, 2013Updated 12 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Speech Emotion Recognition using Deep Learning☆13May 24, 2021Updated 4 years ago
- ☆12Oct 21, 2019Updated 6 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- visual-text to speech☆14Apr 3, 2022Updated 4 years ago
- An experimental custom seq-2-seq model with both layer-wise (inter-layer), and intra-layer attention (attention to previous hidden states…☆10Nov 30, 2017Updated 8 years ago
- Unsupervised feature learning for audio classification using convolutional deep belief networks☆12Jul 25, 2015Updated 10 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- Korean Parallel Corpus☆11Nov 27, 2014Updated 11 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- A curated list of full-duplex spoken dialogue models & benchmarks☆61Apr 30, 2026Updated last week
- Twin Neural Network Training with PyTorch and fast.ai and its Deployment with TorchServe on Amazon SageMaker☆11May 21, 2024Updated last year
- ☆17Mar 24, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Apr 4, 2023Updated 3 years ago
- ☆14Mar 18, 2023Updated 3 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Apr 29, 2020Updated 6 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Ukranian NER annotation project☆93Apr 23, 2025Updated last year