TTS Dia finetuning for Vietnamese
☆125Dec 3, 2025Updated 3 months ago
Alternatives and similar repositories for Dia-Finetuning-Vietnamese
Users that are interested in Dia-Finetuning-Vietnamese are comparing it to the libraries listed below
Sorting:
- EraX Text to Speech base on F5-TTS Base V1☆80May 8, 2025Updated 10 months ago
- ☆140Apr 23, 2025Updated 10 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- SDVN custom node for comfyUI☆93Feb 10, 2026Updated last month
- ☆11Jan 1, 2024Updated 2 years ago
- A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehe…☆13Mar 30, 2025Updated 11 months ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆25Updated this week
- A tutorial on how to train RNN-T from scratch with Whisper encoder☆12Mar 11, 2025Updated 11 months ago
- ☆12Oct 6, 2024Updated last year
- The task aims at extracting required fields in receipts captured by mobile devices☆34Nov 4, 2022Updated 3 years ago
- ☆57Feb 1, 2026Updated last month
- A Vietnamese Voice Cloning Text-to-Speech Model ✨☆509Apr 4, 2025Updated 11 months ago
- Scene text vietnamese☆19May 18, 2022Updated 3 years ago
- ☆23May 21, 2025Updated 9 months ago
- ☆57May 30, 2025Updated 9 months ago
- This is a project about Optical Character Recognition (OCR) in Vietnamese texts by using PaddleOCR and VietOCR.☆27Mar 19, 2024Updated last year
- ☆19Jun 17, 2024Updated last year
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆104Jun 21, 2024Updated last year
- RAG Best Practice on Vietnamese☆258Oct 3, 2025Updated 5 months ago
- ☆47Nov 7, 2023Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24May 1, 2024Updated last year
- ☆11Apr 25, 2025Updated 10 months ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degra…☆39May 28, 2025Updated 9 months ago
- Vietnamese license plate recognition☆88Jul 13, 2022Updated 3 years ago
- ☆69Aug 15, 2024Updated last year
- In this repo, I use encoder, decoder with attention mechanism to auto-correct output of vietnamese ocr model☆30Oct 12, 2021Updated 4 years ago
- AI-powered tool that transforms STEM concepts into narrated educational animations using Manim, LLMs, and multimodal AI☆73Oct 4, 2025Updated 5 months ago
- Comprehensive tools for building (Retrieval Augmented Generation) RAG chatbots.☆83Feb 6, 2025Updated last year
- FireWork HTML☆10Dec 27, 2024Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 10 months ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Jul 22, 2024Updated last year
- Latest hcap images (udpdated from time to time)☆28Feb 25, 2023Updated 3 years ago
- Use Github Workflow to schedule to track Tiki, Shopee price changes☆10Updated this week
- A sample Chatbot in C# using Microsoft Agent Framework☆85Feb 23, 2026Updated 2 weeks ago
- Unofficial Meta Messenger Chat API for NodeJS☆18Updated this week
- 🥑 Intellij plugin to optimization Vector Drawable 🥑☆11Apr 7, 2019Updated 6 years ago
- ☆12May 24, 2025Updated 9 months ago
- A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extra…☆12Oct 5, 2025Updated 5 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago