amphionspace/FlexiCodec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amphionspace/FlexiCodec)

amphionspace / FlexiCodec

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

☆40

Alternatives and similar repositories for FlexiCodec

Users that are interested in FlexiCodec are comparing it to the libraries listed below

Sorting:

Choddeok / DiEmo-TTS
View on GitHub
[INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…
☆16Sep 7, 2025Updated 5 months ago
errolyan / text_normalization_CH
View on GitHub
TTS前，文本标准化，将数字字母处理转化为汉字
☆12Apr 27, 2024Updated last year
P1ping / TokAN
View on GitHub
☆22Jul 30, 2025Updated 7 months ago
AlexIII / g729a-python
View on GitHub
G.729А audio codec for python 3
☆13Mar 18, 2020Updated 5 years ago
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆34Jun 7, 2025Updated 8 months ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 2 years ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆24Mar 19, 2025Updated 11 months ago
exercise-book-yq / Supercodec
View on GitHub
☆49Apr 1, 2025Updated 10 months ago
pengzhendong / streaming-tts-webui
View on GitHub
Streaming Text to Speech Web UI
☆22May 6, 2024Updated last year
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆36Dec 24, 2025Updated 2 months ago
urgent-challenge / urgent2026_challenge_track1
View on GitHub
Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.
☆32Nov 12, 2025Updated 3 months ago
lee-jhwn / fesde
View on GitHub
Toward Fully-End-to-End Listened Speech Decoding from EEG Signals (Interspeech 2024)
☆26Jan 3, 2025Updated last year
hi-paris / Prosody-Control-French-TTS
View on GitHub
An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control
☆30Jan 13, 2026Updated last month
Eps-Acoustic-Revolution-Lab / EAR_VAE
View on GitHub
This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...
☆56Feb 13, 2026Updated 2 weeks ago
IoSR-Surrey / IoSR_ListeningRoom_BRIRs
View on GitHub
The IoSR listening room multichannel BRIR dataset contains binaural room impulse responses measured at head angles of 0 to 360 degrees in…
☆22Mar 24, 2017Updated 8 years ago
ajaybati / miipher2.0
View on GitHub
Reimplementation of Miipher
☆29Aug 16, 2023Updated 2 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆28Aug 4, 2023Updated 2 years ago
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated 8 months ago
wsntxxn / UniFlow-Audio
View on GitHub
☆68Dec 30, 2025Updated 2 months ago
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated 8 months ago
ASLP-lab / LLaSA_Plus
View on GitHub
Llasa Speed Up
☆60Jan 18, 2026Updated last month
tomer9080 / CarelessWhisper-Streaming
View on GitHub
Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
☆62Sep 18, 2025Updated 5 months ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆32Nov 19, 2024Updated last year
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆68Nov 1, 2024Updated last year
Anvarjon / Age-Gender-Classification
View on GitHub
Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…
☆27Mar 5, 2024Updated last year
wooseok-shin / MetricGAN-OKD
View on GitHub
Official PyTorch implementation of "Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement" (ICM…
☆30Mar 18, 2025Updated 11 months ago
tencent-ailab / MuCodec
View on GitHub
☆156Nov 22, 2024Updated last year
pengzhendong / wetext
View on GitHub
Python runtime for WeTextProcessing (does not depend on Pynini)
☆48Nov 28, 2025Updated 3 months ago
Mddct / transformer-vocos
View on GitHub
☆36Sep 6, 2025Updated 5 months ago
ZhikangNiu / A-DMA
View on GitHub
[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆64Jun 16, 2025Updated 8 months ago
zeroone-universe / RealTimeBWE
View on GitHub
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
☆41Oct 20, 2025Updated 4 months ago
ReiherGroup / CoRe_optimizer
View on GitHub
Continual Resilient (CoRe) Optimizer for PyTorch
☆11Jun 10, 2024Updated last year
yukara-ikemiya / Open-Miipher-2
View on GitHub
PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind
☆64Sep 22, 2025Updated 5 months ago
jjunak-yun / FLowHigh_code
View on GitHub
[ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"
☆108Jan 17, 2025Updated last year
jonaskohler / stereoEEG2speech
View on GitHub
Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using th…
☆34Sep 2, 2022Updated 3 years ago
AI-S2-Lab / GPT-Talker
View on GitHub
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆44Oct 28, 2024Updated last year
archisman-panigrahi / QuickBib
View on GitHub
Get BibTeX from a DOI — fast
☆28Updated this week
ishine / Project_sp_ehance_matlab
View on GitHub
☆12Jun 17, 2019Updated 6 years ago
sp-uhh / sgmse-bbed
View on GitHub
Brownian Bridge with Exponential Diffusion Coefficient
☆44Nov 1, 2023Updated 2 years ago