pigeonai-org / ViDoveLinks
🐦ViDove: RAG-Augmented End-to-end Multimodal Translation Agent
☆97Updated this week
Alternatives and similar repositories for ViDove
Users that are interested in ViDove are comparing it to the libraries listed below
Sorting:
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。☆563Updated 2 months ago
- 翻译姬:致力于小众领域的机器翻译☆538Updated 10 months ago
- ☆613Updated 2 years ago
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆991Updated 2 years ago
- 多个SVC/TTS的C++推理库☆1,088Updated last month
- ☆389Updated last year
- A general-purpose CV-based framework for extracting precise subtitle timelines from videos with embedded subtitles, from video to .ass fi…☆70Updated last month
- Tacotron2 implementation of Japanese☆269Updated 2 years ago
- AI GAL是专门为Galgame场景设计的程序,旨在让得每一名用户都能享受到独一无二的剧情。程序基于renpy框架开发☆212Updated last week
- async http process VST plugin☆161Updated 2 years ago
- Implementation of the VITS model☆402Updated last year
- StarRail Datasets For SVC/SVS/TTS☆324Updated last month
- A voiceprint recognition classifier for audio dataset☆101Updated 2 years ago
- 你没体验过的船新自动打轴机2.0版☆407Updated 4 years ago
- Combining chatglm, vits, and pycqhttp for local deployment of qq chatbots.结合chatglm,vits,pycqhttp的本地部署qq聊天机器人。☆41Updated 2 years ago
- Asset Viewer for Uma Musume☆85Updated 2 years ago
- 纯良嘉心糖不会梦见电子彼女☆65Updated 2 years ago
- Genshin Datasets For SVC/SVS/TTS☆684Updated last month
- GUI for MoeGoe☆571Updated last year
- A convenient tool for generating audio files☆135Updated 2 years ago
- Forked from innnky so-vits-svc☆151Updated 2 years ago
- Combined ChatGPT with Moegoe TTS to create a Chatting Waifu☆829Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆161Updated 2 years ago
- 记录mygo在b站下架前的样子☆298Updated 7 months ago
- ACG Text-to-Speech☆175Updated 2 years ago
- Storyteller(说书人) 自动发送弹幕插件,独特的说书人功能、屏蔽颜色弹幕功能以及单句/多句自动定时循环发射功能、编程功能、计数器功能、临时弹幕功能、快速发射功能、定时启动功能,本项目基于GPLv2。☆243Updated last year
- Voice dataset of Genshin Impact 原神语音数据集☆710Updated 2 years ago
- AI for MAA☆187Updated last year
- An auxiliary tool for manual screening of audio dataset.☆126Updated 2 years ago
- Extract the voice and corresponding text☆84Updated 5 months ago