pigeonai-org / ViDoveLinks
[EMNLP 2025 Demo] 🐦ViDove: A Translation Agent System with Multimodal Context and Memory-Augmented Reasoning
☆110Updated last month
Alternatives and similar repositories for ViDove
Users that are interested in ViDove are comparing it to the libraries listed below
Sorting:
- 翻译姬:致力于小众领域的机器翻译☆539Updated last year
- Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc☆994Updated 2 years ago
- Tacotron2 implementation of Japanese☆269Updated 3 years ago
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。☆629Updated 6 months ago
- GUI for MoeGoe☆572Updated 2 years ago
- 多个SVC/TTS的C++推理库☆1,119Updated 8 months ago
- An automatic music transcription application☆79Updated 2 years ago
- ☆624Updated 3 years ago
- Genshin Datasets For SVC/SVS/TTS☆712Updated 3 weeks ago
- An unofficial implementation of the combination of Soft-VC and VITS☆456Updated 3 years ago
- A general-purpose CV-based framework for extracting precise subtitle timelines from videos with embedded subtitles, from video to .ass fi…☆91Updated 5 months ago
- DiffSinger community vocoders release page☆299Updated 11 months ago
- A convenient tool for generating audio files☆134Updated 3 years ago
- StarRail Datasets For SVC/SVS/TTS☆336Updated 6 months ago
- ☆33Updated 2 years ago
- Forked from innnky so-vits-svc☆152Updated 2 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆161Updated 2 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆940Updated 2 years ago
- ACG Text-to-Speech☆175Updated 3 years ago
- A voiceprint recognition classifier for audio dataset☆106Updated 2 years ago
- Executable file for VITS inference☆2,410Updated 2 years ago
- ☆387Updated 2 years ago
- async http process VST plugin☆160Updated 2 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆469Updated 3 years ago
- GUI TTS Application based on Bert-VITS2☆30Updated 2 years ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,395Updated 2 years ago
- An auxiliary tool for manual screening of audio dataset.☆133Updated 2 years ago
- An application specialized in image super-resolution for ACGN illustrations and Visual Novel CG. 专注于插画/Galgame CG等ACGN领域的图像超分辨率的应用☆370Updated last month
- SOME: Singing-Oriented MIDI Extractor.☆653Updated 2 weeks ago
- 樱之刻简中汉化☆383Updated last year