pigeonai-org / ViDoveLinks
🐦ViDove: RAG-Augmented End-to-end Multimodal Translation Agent
☆99Updated this week
Alternatives and similar repositories for ViDove
Users that are interested in ViDove are comparing it to the libraries listed below
Sorting:
- 翻译姬:致力于小众领域的机器翻译☆538Updated 11 months ago
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆987Updated 2 years ago
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。☆581Updated 3 weeks ago
- Based on Talking-head-anime 3, works like Vtube Studio.☆2,472Updated this week
- A convenient tool for generating audio files☆135Updated 2 years ago
- ☆615Updated 2 years ago
- Tacotron2 implementation of Japanese☆269Updated 2 years ago
- StarRail Datasets For SVC/SVS/TTS☆327Updated 3 weeks ago
- An automatic music transcription application☆78Updated last year
- Genshin Datasets For SVC/SVS/TTS☆689Updated 3 weeks ago
- ☆388Updated last year
- DiffSinger community vocoders release page☆289Updated 5 months ago
- 多个SVC/TTS的C++推理库☆1,095Updated 3 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆471Updated 2 years ago
- A voiceprint recognition classifier for audio dataset☆103Updated 2 years ago
- AI for MAA☆192Updated 2 weeks ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- async http process VST plugin☆160Updated 2 years ago
- 记录mygo在b站下架前的样子☆299Updated 8 months ago
- GUI for MoeGoe☆570Updated last year
- 从Galgame中提取人物语音和对应文本用于制作SVC/TTS项目的数据集。Extract character voice and corresponding text from Galgame to create a dataset for SVC/TTS project…☆42Updated 11 months ago
- Implementation of the VITS model☆401Updated 2 years ago
- An unofficial implementation of the combination of Soft-VC and VITS☆461Updated 2 years ago
- ☆33Updated last year
- 你没体验过的船新自动打轴机2.0版☆406Updated 4 years ago
- An auxiliary tool for manual screening of audio dataset.☆129Updated 2 years ago
- GUI TTS Application based on Bert-VITS2☆29Updated last year
- Combined ChatGPT with Moegoe TTS to create a Chatting Waifu☆832Updated last year
- ACG Text-to-Speech☆175Updated 2 years ago
- SOME: Singing-Oriented MIDI Extractor.☆585Updated 7 months ago