thomas-xin / Encodec-StreamLinks
A lightweight wrapper around https://github.com/facebookresearch/encodec that enables dynamic streamed reading, seeking, metadata and GPU support.
☆15Updated last year
Alternatives and similar repositories for Encodec-Stream
Users that are interested in Encodec-Stream are comparing it to the libraries listed below
Sorting:
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆133Updated 2 months ago
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆138Updated last month
- An unofficial PyTorch implementation of VALL-E☆88Updated 4 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆104Updated last year
- ☆61Updated 2 years ago
- Audiogen Codec☆143Updated last year
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated 2 years ago
- ☆93Updated last year
- ☆56Updated 2 years ago
- Pytorch implementation of SoundCTM☆100Updated 8 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆44Updated 2 months ago
- a lightweight voice conversion☆85Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆51Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆81Updated 2 years ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Updated 2 weeks ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆91Updated 11 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- ☆43Updated last year
- Unofficial implementation of wavenext vocoder☆53Updated last year
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆194Updated 5 months ago
- GPT-style network for phonemization with durations of text☆68Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- ☆45Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆109Updated last year
- Collection of scripts from mHuBERT-147.☆32Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆36Updated 9 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Updated last year
- Official repository of Wavehax vocoder☆62Updated last week
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year