Many0therFunctions/MaskGCT-Text-To-Semantic-Finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Many0therFunctions/MaskGCT-Text-To-Semantic-Finetune)

Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune

This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. This is just an inspirational proof of concept of what to do as a mere mortal when a neural network is open weights but not open source. Can ChadGPT reverse engineer an AI for a regular user? We shall see.

☆13

Alternatives and similar repositories for MaskGCT-Text-To-Semantic-Finetune

Users that are interested in MaskGCT-Text-To-Semantic-Finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆21Feb 18, 2024Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year
postlang / posthog-llm
View on GitHub
PostHog with text analytics extensions, serving as an advanced LLM analytics platform.
☆15Sep 17, 2024Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated last year
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago
DanielLin94144 / StyleTalk
View on GitHub
Official release of StyleTalk dataset.
☆75Jul 1, 2024Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
Rongjiehuang / awesome-speech-to-speech-translation
View on GitHub
List of direct speech-to-speech translation papers.
☆39Jan 31, 2023Updated 3 years ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
hongwen-sun / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆15Dec 19, 2018Updated 7 years ago
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 9 months ago
TylorShine / MNP-SVC
View on GitHub
Real-time end-to-end singing voice convertion
☆25Nov 3, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆23Oct 10, 2025Updated 9 months ago
CSTR-Edinburgh / qualtreats
View on GitHub
Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.
☆36Jun 25, 2024Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
lukaszliniewicz / breath-removal
View on GitHub
Detect and remove or lower the volume of breathing in speech recordings.
☆17May 14, 2025Updated last year
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
changelinglab / PhoneticXeus
View on GitHub
A universal phone recognizer that can transcribe speech in 70+ languages into IPA
☆29Jun 9, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ndkgit339 / spe-dss
View on GitHub
Speech Parameter Estimation Using Differentiable Speech Synthesizer
☆43May 9, 2023Updated 3 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
quinte22 / bumblebee
View on GitHub
bumble bee transformer
☆14Apr 19, 2021Updated 5 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
Naozumi520 / g2pW-Cantonese
View on GitHub
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆15Dec 10, 2024Updated last year