pigeonai-org / ViDoveLinks
[EMNLP 2025 Demo] 🐦ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning
☆105Updated 3 weeks ago
Alternatives and similar repositories for ViDove
Users that are interested in ViDove are comparing it to the libraries listed below
Sorting:
- 翻译姬:致力于小众领域的机器翻译☆535Updated last year
- async http process VST plugin☆160Updated 2 years ago
- 多个SVC/TTS的C++推理库☆1,100Updated 5 months ago
- Tacotron2 implementation of Japanese☆269Updated 3 years ago
- Genshin Datasets For SVC/SVS/TTS☆696Updated 2 months ago
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆991Updated 2 years ago
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。☆599Updated 2 months ago
- StarRail Datasets For SVC/SVS/TTS☆333Updated 2 months ago
- An automatic music transcription application☆79Updated 2 years ago
- AI for MAA☆203Updated last month
- Combining chatglm, vits, and pycqhttp for local deployment of qq chatbots.结合chatglm,vits,pycqhttp的本地部署qq聊天机器人。☆41Updated 2 years ago
- A general-purpose CV-based framework for extracting precise subtitle timelines from videos with embedded subtitles, from video to .ass fi…☆78Updated 2 months ago
- 你没体验过的船新自动打轴机2.0版☆409Updated 4 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆470Updated 2 years ago
- A convenient tool for generating audio files☆135Updated 2 years ago
- 纯良嘉心糖不会梦见电子彼女☆65Updated 2 years ago
- DiffSinger community vocoders release page☆291Updated 7 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆166Updated 2 years ago
- AI GAL是专门为Galgame场景设计的程序,旨在让得每一名用户都能享受到独一无二的剧情。程序基于renpy框架开发☆221Updated last month
- 记录mygo在b站下架前的样子☆300Updated 10 months ago
- Voice dataset of Genshin Impact 原神语音数据集☆713Updated 2 years ago
- ☆621Updated 2 years ago
- A voiceprint recognition classifier for audio dataset☆105Updated 2 years ago
- Implementation of the VITS model☆400Updated 2 years ago
- ☆97Updated 2 years ago
- GUI TTS Application based on Bert-VITS2☆29Updated last year
- An easy to understand TTS / SVS / SVC framework☆718Updated 2 weeks ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162Updated 2 years ago
- ☆386Updated last year
- 爬取B站动态 获取Bilibili的单个用户的全部动态列表☆92Updated 2 years ago