The power-law compressed phase-aware asymmetric (PLCPA-ASYM) loss
☆14Sep 4, 2023Updated 2 years ago
Alternatives and similar repositories for PLCPA-ASYM-Loss
Users that are interested in PLCPA-ASYM-Loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 4 years ago
- ☆29Apr 17, 2023Updated 3 years ago
- Short-Time Discrete Cosine Transform (DCT) for Python. SciPy, TensorFlow and PyTorch implementations.☆28Feb 11, 2021Updated 5 years ago
- A Temporal-Spectral Generative Adversarial Network based End-to-end Packet Loss Concealment for Wideband Speech Transmission☆32Apr 27, 2022Updated 4 years ago
- ☆17Sep 12, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.☆91Feb 13, 2026Updated 3 months ago
- This repository contains the trained models and some audio samples for the tPLCnet.☆29Sep 26, 2023Updated 2 years ago
- ☆17Mar 30, 2023Updated 3 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- ☆14Oct 12, 2023Updated 2 years ago
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆15Sep 6, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆38Jan 17, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Jan 31, 2022Updated 4 years ago
- The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP☆19Apr 27, 2022Updated 4 years ago
- Pytorch implementation of DPCRN☆28Mar 31, 2024Updated 2 years ago
- ☆52Sep 10, 2024Updated last year
- ☆135Apr 24, 2023Updated 3 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- Differentiable dynamic range controller in PyTorch.☆52Feb 10, 2026Updated 4 months ago
- Gated Convolutional F-T-LSTM Neural Network☆39Jun 15, 2022Updated 3 years ago
- ☆55Mar 2, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆48Feb 17, 2026Updated 3 months ago
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆19Feb 25, 2025Updated last year
- ☆48Jul 7, 2025Updated 11 months ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆160Jul 16, 2022Updated 3 years ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆189Apr 15, 2025Updated last year
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆91Jun 8, 2025Updated last year
- ☆13Jan 12, 2024Updated 2 years ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆25Sep 27, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Papez: Resource-Efficient Speech Separation with Auditory Working Memory (ICASSP 2023)☆21Jun 25, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆60Apr 14, 2025Updated last year
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- ☆43May 27, 2024Updated 2 years ago
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- ☆117Jan 8, 2021Updated 5 years ago