XZWY/SpatialCodec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XZWY/SpatialCodec)

XZWY / SpatialCodec

Implementation of SpatialCodec.

☆71

Alternatives and similar repositories for SpatialCodec

Users that are interested in SpatialCodec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Andong-Li-speech / G2Net
View on GitHub
The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP
☆19Apr 27, 2022Updated 4 years ago
XZWY / MSLDM
View on GitHub
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆29Sep 12, 2024Updated last year
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
wangtianrui / APC-SNR
View on GitHub
Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch
☆28Jan 31, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Andong-Li-speech / TaylorBeamformer
View on GitHub
The implementation of TaylorBeamformer, which is in submission to Interspeech2022
☆49Jun 10, 2022Updated 4 years ago
anton-jeran / MULTI-AUDIODEC
View on GitHub
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆54Mar 17, 2025Updated last year
FYJNEVERFOLLOWS / LaBNet
View on GitHub
Official PyTorch implementation of the Interspeech 2023 paper
☆29Jul 5, 2023Updated 3 years ago
felixperfler / Stable-Hybrid-Auditory-Filterbanks
View on GitHub
[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
☆43Jul 25, 2025Updated 11 months ago
vkothapally / JAECBF
View on GitHub
☆62Apr 11, 2022Updated 4 years ago
vkothapally / Subband-Beamformer
View on GitHub
☆34Nov 29, 2022Updated 3 years ago
Aworselife / DPTBF
View on GitHub
☆17Sep 12, 2023Updated 2 years ago
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
ChengGuoliang0 / SBSS-NAEC-CTF
View on GitHub
☆16Oct 31, 2022Updated 3 years ago
bfs18 / rfwave
View on GitHub
☆151Apr 25, 2025Updated last year
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
Andong-Li-speech / MDNet
View on GitHub
The implementation of MDNet, which is in submission to Interspeech2022
☆14May 1, 2022Updated 4 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 8 months ago
Andong-Li-speech / EaBNet
View on GitHub
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…
☆107Jun 10, 2022Updated 4 years ago
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Emrys365 / se-scaling
View on GitHub
Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…
☆41Aug 7, 2024Updated last year
Qinwen-Hu / SDCM
View on GitHub
☆25Feb 28, 2023Updated 3 years ago
yuguochencuc / BAE-Net
View on GitHub
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
☆80Aug 20, 2024Updated last year
fgnt / graph_pit
View on GitHub
☆42Oct 14, 2022Updated 3 years ago
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
haoheliu / SemantiCodec-inference
View on GitHub
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
☆254Mar 7, 2025Updated last year
kyegomez / AudioFlamingo
View on GitHub
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…
☆39Jan 27, 2025Updated last year
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RoyChao19477 / PCS
View on GitHub
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
☆73May 11, 2024Updated 2 years ago
ASLP-lab / SenSE
View on GitHub
Official code of SenSE.
☆90Oct 30, 2025Updated 8 months ago
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆72Aug 15, 2025Updated 11 months ago
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
Andong-Li-speech / RNDVoC
View on GitHub
This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.
☆54Oct 11, 2025Updated 9 months ago
Aria-K-Alethia / BigCodec
View on GitHub
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆218Sep 19, 2024Updated last year
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆52Jun 14, 2022Updated 4 years ago