40740 / Bert-VITS2-2View external linksLinks
☆13Mar 7, 2024Updated last year
Alternatives and similar repositories for Bert-VITS2-2
Users that are interested in Bert-VITS2-2 are comparing it to the libraries listed below
Sorting:
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 3 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Aug 31, 2023Updated 2 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆25Feb 11, 2023Updated 3 years ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆63Sep 23, 2023Updated 2 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- PyTorch implmenetation of YOLO v3, including training and testing, and can be adapted for user-defined dataset☆31Feb 21, 2019Updated 6 years ago
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- ☆35Mar 14, 2023Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- ☆38Sep 5, 2023Updated 2 years ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated last year
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- Application for Math formula detection in image/pdf and then recognition☆12Jan 14, 2025Updated last year
- ☆11Aug 20, 2025Updated 5 months ago
- Network library implemented with C++23 standard☆10Jan 10, 2026Updated last month
- Term Project at GTCMT exploring phase based features for Singing Voice Detection with Neural Networks☆11Apr 20, 2018Updated 7 years ago
- All in one scanner Made on flutter Framework☆12Jun 2, 2021Updated 4 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 3 months ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated 3 weeks ago
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 3 months ago
- An SSH plugin for Dify☆12Jan 16, 2026Updated 3 weeks ago
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- wav2lip训练数据预处理综合工具☆40Nov 18, 2023Updated 2 years ago
- Just a suturing monster project.☆42Nov 21, 2023Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated last year
- Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring☆24Dec 15, 2025Updated 2 months ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Official repository of Tapir Lab.'s Lip-Sync Method☆10Oct 3, 2023Updated 2 years ago
- ☆10Feb 17, 2023Updated 2 years ago
- DB-based Optical Chemical Structure Recognition☆12Sep 12, 2022Updated 3 years ago