☆13Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for Bert-VITS2-2
Users that are interested in Bert-VITS2-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- Documentation for Bert-VITS2☆22Nov 29, 2023Updated 2 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 4 years ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20May 28, 2025Updated 10 months ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Code for the EMNLP paper "Improving Detection and Categorization of Task-relevant Utterances through Integration of Discourse Structure a…☆12Nov 23, 2022Updated 3 years ago
- ☆12Mar 20, 2020Updated 6 years ago
- optimized wav2lip☆18Jan 6, 2024Updated 2 years ago
- ☆39Sep 5, 2023Updated 2 years ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆63Sep 23, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- https://subversion.assembla.com/svn/buddy-profiles.honorbuddy/trunk/☆11Jan 17, 2015Updated 11 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆22May 26, 2025Updated 10 months ago
- A comprehensive framework for benchmarking single and multi-agent systems across a wide range of tasks—evaluating performance, accuracy, …☆37Nov 11, 2025Updated 5 months ago
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Aug 31, 2023Updated 2 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- Code and data for the paper "Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems".☆14Aug 16, 2022Updated 3 years ago
- ☆25Feb 11, 2023Updated 3 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- Simple Conversational Data Augmentation for Semi-supervised Abstractive Conversation Summarization☆10Mar 7, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- wav2lip-api☆11Mar 16, 2023Updated 3 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 7 months ago
- This is the proposal network for MultiPerson Pose Estimation.☆14Oct 21, 2017Updated 8 years ago
- wav2lip训练数据预处理综合工具☆40Nov 18, 2023Updated 2 years ago
- ☆10Feb 17, 2023Updated 3 years ago
- text to speech using autoregressive transformer and VITS☆248Apr 3, 2024Updated 2 years ago
- Network library implemented with C++23 standard☆10Mar 28, 2026Updated 2 weeks ago
- experiments about AudioSet☆43Jul 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆23Apr 10, 2025Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆21Jul 1, 2024Updated last year
- 基于folly、wangle和proxygen的c++11基础库☆11Apr 29, 2018Updated 7 years ago
- 人工智能与深度学习实战 - TensorFlow 篇(MD & Notebooks)☆13Nov 8, 2025Updated 5 months ago
- ☆14Sep 10, 2025Updated 7 months ago