A speech signal processing library in Python with emphasis on deep learning.
☆31Jul 16, 2022Updated 3 years ago
Alternatives and similar repositories for pyaudlib
Users that are interested in pyaudlib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Sep 25, 2024Updated last year
- ☆19Mar 10, 2023Updated 3 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆21Jul 4, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Generalized Minimal Distortion Principle for Blind Source Separation☆22Sep 16, 2020Updated 5 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Experimenting with Lapped Transforms Jupyter Notebook☆14Jun 13, 2025Updated 9 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆15Jul 11, 2022Updated 3 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Jul 6, 2023Updated 2 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆15Jul 4, 2024Updated last year
- Digital Audio Effects in Python (material for MUSI6202@Georgiatech)☆15Nov 30, 2014Updated 11 years ago
- ☆18May 15, 2021Updated 4 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Official PyTorch code for Deep Audio-Signal Holistic Embeddings☆187Nov 7, 2025Updated 4 months ago
- Collection of models and extensions for deployment in PyTorch☆24Nov 20, 2022Updated 3 years ago
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆72Feb 26, 2026Updated 3 weeks ago
- ☆17Jan 26, 2021Updated 5 years ago
- ☆14Jun 13, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 5 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- An extension for rendering sandboxed iframes inside of Panels☆18Feb 28, 2018Updated 8 years ago
- Comprehensive Python library for speech and voice.☆32Dec 8, 2022Updated 3 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- Official PyTorch implementation of the Interspeech 2023 paper☆28Jul 5, 2023Updated 2 years ago
- Code for the paper "Self-Supervised Learning for Anomalous Sound Detection"☆40May 13, 2024Updated last year
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Apr 1, 2020Updated 5 years ago
- FastAPI Server Implementation for Bilibili Index TTS☆25Apr 13, 2025Updated 11 months ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 6 years ago
- ☆22Nov 25, 2025Updated 4 months ago
- ☆29Mar 6, 2026Updated 2 weeks ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆18Feb 9, 2020Updated 6 years ago