v3ucn / Bert-vits2-V2.3
View external linksLinks

Bert-vits2-V2.3 训练和推理

☆50

Alternatives and similar repositories for Bert-vits2-V2.3

Users that are interested in Bert-vits2-V2.3 are comparing it to the libraries listed below

Sorting:

codebyzeb / g2p-plus
View on GitHub
Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories
☆19Apr 10, 2025Updated 10 months ago
HauLiang / DAMAS-FISTA-Net
View on GitHub
Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming
☆15Aug 20, 2024Updated last year
v3ucn / Bert-VITS2-Extra_-
View on GitHub
Bert-VITS2-Extra_中文特化版本训练和推理
☆26Feb 10, 2024Updated 2 years ago
MuyangDu / T5Voice
View on GitHub
T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …
☆28Nov 7, 2025Updated 3 months ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated last year
jadepeng / bertTokenizer
View on GitHub
java implementation of Bert Tokenizer, support output onnx tensor for onnx model inference
☆12Sep 4, 2023Updated 2 years ago
huahuahuage / Bert-VITS2-Speech
View on GitHub
Bert-VITS2 onnx推理版本
☆44Apr 24, 2024Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated 10 months ago
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆20Feb 10, 2025Updated last year
crystal0913 / merlin-tts
View on GitHub
c++ code for merlin tts
☆22Oct 19, 2019Updated 6 years ago
ZarahShibli / Arabic_Punctuation_Prediction
View on GitHub
Sequence to sequence model for Arabic punctuation prediction.
☆12Feb 13, 2020Updated 6 years ago
Harry-Yu-Shuhang / Step-Audio-tts
View on GitHub
☆11Feb 20, 2025Updated 11 months ago
zhaohb / MeloTTS-OV
View on GitHub
Using OpenVINO to speed up MeloTTS inference
☆15Nov 1, 2024Updated last year
season-studio / MeloTTS-ONNX
View on GitHub
An implementation of MeloTTS by onnxruntime
☆29Oct 27, 2024Updated last year
v3ucn / Fix-Loudness
View on GitHub
音频响度统一，音量归一化处理
☆12May 3, 2024Updated last year
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
View on GitHub
C++ version of pyannote audio overlapped speech detection pipeline
☆13Feb 14, 2024Updated 2 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 3 years ago
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
Tele-AI / TELEVAL
View on GitHub
☆22Jan 29, 2026Updated 2 weeks ago
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆13Apr 6, 2025Updated 10 months ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
v3ucn / ASR_TOOLS_SenseVoice_WebUI
View on GitHub
Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型
☆184Jul 10, 2024Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Jun 5, 2023Updated 2 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
facebookresearch / llama-hd-dataset
View on GitHub
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆22Jan 22, 2024Updated 2 years ago
fengpeng-yue / ASRTTS
View on GitHub
ASR & TTS joint training, asr, tts, machine speech chain
☆16Oct 16, 2021Updated 4 years ago
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆22Oct 10, 2025Updated 4 months ago
Picovoice / tts-latency-benchmark
View on GitHub
Text-to-Speech Latency Benchmark
☆22Jan 16, 2026Updated last month
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
Zeqiang-Lai / Prosody_Prediction
View on GitHub
Predict prosody labels for Chinese sentences.
☆41Jul 7, 2022Updated 3 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 5 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆19May 12, 2023Updated 2 years ago
pengzhendong / streaming-tts-webui
View on GitHub
Streaming Text to Speech Web UI
☆22May 6, 2024Updated last year
sigmeta / g2p-kd
View on GitHub
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion
☆20Jul 9, 2019Updated 6 years ago
Minjun-KANG / Wav2Lip_Windows_GUI
View on GitHub
Wav2Lip model Windows GUI Program using PyQT5
☆19Jun 4, 2021Updated 4 years ago
jiangyuxiaoxiao / Bert-VITS2-UI
View on GitHub
BertVITS2前端界面
☆303Jan 1, 2024Updated 2 years ago
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 2 years ago

v3ucn / Bert-vits2-V2.3View external linksLinks

Alternatives and similar repositories for Bert-vits2-V2.3

v3ucn / Bert-vits2-V2.3
View external linksLinks