SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.
☆209Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for SubFix
Users that are interested in SubFix are comparing it to the libraries listed below
Sorting:
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆49Jun 17, 2024Updated last year
- vits2 backbone with bert☆337Apr 13, 2024Updated last year
- vits2 backbone with multilingual-bert☆8,692Feb 23, 2026Updated last week
- Python script that slices audio with silence detection☆869Jun 8, 2024Updated last year
- A voiceprint recognition classifier for audio dataset☆105Jun 21, 2023Updated 2 years ago
- a TTS demo for training new characters.☆470Jan 5, 2024Updated 2 years ago
- Genshin Datasets For SVC/SVS/TTS☆717Jan 11, 2026Updated last month
- Bert-VITS2_V202本地一键推理☆20Nov 23, 2023Updated 2 years ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆1,044Oct 5, 2025Updated 4 months ago
- BertVITS2前端界面☆303Jan 1, 2024Updated 2 years ago
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,227Feb 5, 2024Updated 2 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆80Oct 10, 2023Updated 2 years ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆5,019Jan 21, 2025Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Colab adaptation of MVSep Model for MDX23 music separation contest☆328Sep 25, 2024Updated last year
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 3 months ago
- text to speech using autoregressive transformer and VITS☆249Apr 3, 2024Updated last year
- Subtitle dubbing with multiple TTS Engines☆229Oct 31, 2025Updated 4 months ago
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated last year
- VITS2 for Chinese speech | 最新VITS2中文语音合成☆136Oct 26, 2023Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆100Sep 23, 2022Updated 3 years ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,967Dec 19, 2025Updated 2 months ago
- 基于 g2pW 提升 pypinyin 的准确性☆104Jun 24, 2023Updated 2 years ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆227Jun 24, 2025Updated 8 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- unofficial vits2-TTS implementation in pytorch☆547Mar 28, 2024Updated last year
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆380Jun 21, 2025Updated 8 months ago
- A simple GUI application that slices audio with silence detection☆1,439Jul 29, 2024Updated last year
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆55,429Feb 9, 2026Updated 3 weeks ago
- vits2 backbone with bert☆83Jan 8, 2024Updated 2 years ago
- The reproduced code for Google's SoundStorm☆270Oct 7, 2023Updated 2 years ago
- A VapourSynth filter that displays the FFT frequency spectrum of a given clip.☆12Dec 12, 2021Updated 4 years ago