VietASR - Vietnamese Automatic Speech Recognition
☆166Mar 26, 2026Updated 3 weeks ago
Alternatives and similar repositories for viet-asr
Users that are interested in viet-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆379Nov 22, 2021Updated 4 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Fine-tuning Vietnamese Text-to-speech model (VITS)☆58Mar 18, 2025Updated last year
- Vietnamese Text to Speech library☆257Aug 20, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆211Nov 12, 2024Updated last year
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆15Aug 13, 2024Updated last year
- VietTTS: An Open-Source Vietnamese Text to Speech☆84Dec 23, 2025Updated 3 months ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- Vietnamese Automatic Speech Recognition☆71Jan 6, 2019Updated 7 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for t…☆373Sep 5, 2022Updated 3 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Jun 5, 2025Updated 10 months ago
- A Vietnamese Voice Cloning Text-to-Speech Model ✨☆514Apr 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆148Apr 23, 2025Updated 11 months ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38May 23, 2023Updated 2 years ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆156Aug 27, 2023Updated 2 years ago
- ☆110Oct 9, 2023Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- A synthesized dataset for Vietnamese TTS task☆66May 6, 2022Updated 3 years ago
- A toolbox for Vietnamese Optical Character Recognition.☆135Oct 11, 2022Updated 3 years ago
- Transformation spoken text to written text☆31May 14, 2024Updated last year
- Solution for MC_OCR competition☆95Mar 7, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆280Sep 1, 2025Updated 7 months ago
- Transformer OCR☆763Jan 19, 2025Updated last year
- Docker image for ResourceSpace☆33Oct 10, 2025Updated 6 months ago
- ☆23Oct 15, 2018Updated 7 years ago
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Jul 25, 2024Updated last year
- ☆22Feb 9, 2018Updated 8 years ago
- Use LoRA technique to improve training Large Language Model☆13Jul 25, 2023Updated 2 years ago
- ☆67Apr 12, 2024Updated 2 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆776Jul 23, 2024Updated last year
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Dec 5, 2022Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆151Feb 6, 2025Updated last year
- Bud500: A Comprehensive Vietnamese ASR Dataset☆69Oct 10, 2025Updated 6 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆23Jul 1, 2024Updated last year
- eKYC (Electronic Know Your Customer) is a project designed to electronically verify the identity of customers☆54Dec 7, 2024Updated last year
- Vietnamese speech recognition using Wavenet☆73Feb 2, 2023Updated 3 years ago