SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.
☆210Feb 5, 2024Updated 2 years ago
Alternatives and similar repositories for SubFix
Users that are interested in SubFix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Split audio using the .srt file, clean up annotations, then merge and package into a format suitable for bert-vits2 in a standard manner.…☆49Jun 17, 2024Updated last year
- vits2 backbone with bert☆337Apr 13, 2024Updated 2 years ago
- vits2 backbone with multilingual-bert☆8,720Apr 6, 2026Updated last week
- Bert-VITS2_V202本地一键推理☆20Nov 23, 2023Updated 2 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python script that slices audio with silence detection☆869Jun 8, 2024Updated last year
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- a TTS demo for training new characters.☆471Jan 5, 2024Updated 2 years ago
- Genshin Datasets For SVC/SVS/TTS☆721Jan 11, 2026Updated 3 months ago
- A voiceprint recognition classifier for audio dataset☆105Jun 21, 2023Updated 2 years ago
- BertVITS2前端界面☆304Jan 1, 2024Updated 2 years ago
- A simple VITS HTTP API, developed by extending Moegoe with additional features.☆1,047Oct 5, 2025Updated 6 months ago
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,230Feb 5, 2024Updated 2 years ago
- Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注☆80Oct 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆5,020Jan 21, 2025Updated last year
- text to speech using autoregressive transformer and VITS☆248Apr 3, 2024Updated 2 years ago
- vits2 backbone with bert☆83Jan 8, 2024Updated 2 years ago
- Subtitle dubbing with multiple TTS Engines☆238Mar 22, 2026Updated 3 weeks ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆233Jun 24, 2025Updated 9 months ago
- VITS2 for Chinese speech | 最新VITS2中文语音合成☆135Oct 26, 2023Updated 2 years ago
- Colab adaptation of MVSep Model for MDX23 music separation contest☆330Sep 25, 2024Updated last year
- Unoffical implementation of Megatts2☆286Mar 23, 2024Updated 2 years ago
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,967Dec 19, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆56,600Feb 9, 2026Updated 2 months ago
- This is my CUDA optimization of OpenCV seamlessClone API at NORMAL_CLONE mode.☆10Oct 29, 2023Updated 2 years ago
- 适用于 diffsinger 的多功能工具集☆11Apr 2, 2023Updated 3 years ago
- A simple GUI application that slices audio with silence detection☆1,441Apr 5, 2026Updated last week
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆36May 7, 2025Updated 11 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆183Jul 10, 2024Updated last year
- The reproduced code for Google's SoundStorm☆273Oct 7, 2023Updated 2 years ago
- 基于 g2pW 提升 pypinyin 的准确性☆104Jun 24, 2023Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆101Sep 23, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Documentation for Bert-VITS2☆22Nov 29, 2023Updated 2 years ago
- A simple GUI to show shot boundary detection based on TransNet V2.☆29Dec 5, 2020Updated 5 years ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆19Nov 23, 2024Updated last year
- ☆14Apr 2, 2023Updated 3 years ago
- Bert-vits2-V2.3 训练和推理☆50Mar 13, 2024Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)☆390Jun 21, 2025Updated 9 months ago