PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)
☆212Nov 12, 2024Updated last year
Alternatives and similar repositories for PhoWhisper
Users that are interested in PhoWhisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VietASR - Vietnamese Automatic Speech Recognition☆166Apr 21, 2026Updated 2 weeks ago
- ☆25Aug 28, 2024Updated last year
- Vietnamese Text to Speech library☆259Aug 20, 2023Updated 2 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆23Jun 5, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Oct 6, 2023Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆109Jun 21, 2024Updated last year
- Detecting Omissions in Geographic Maps through Computer Vision (MAPR'24)☆24Jul 31, 2024Updated last year
- ☆151Apr 23, 2025Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- Dịch máy giữa ngôn ngữ anh-viet☆63Jun 21, 2020Updated 5 years ago
- Sentiment classification for Vietnamese text using PhoBert☆98Nov 16, 2020Updated 5 years ago
- Corpus tiếng việt☆383Oct 3, 2025Updated 7 months ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆157Aug 27, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆20Aug 15, 2025Updated 8 months ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- ☆79May 4, 2024Updated 2 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)☆780Jul 23, 2024Updated last year
- ☆13Oct 27, 2025Updated 6 months ago
- A dataset for Vietnamese Spelling Correction☆15Sep 27, 2021Updated 4 years ago
- Solution for MC_OCR competition☆95Mar 7, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆349Jul 22, 2024Updated last year
- Dự án bao gồm: 1. Xây dựng bộ dữ Instructions Vietnamese (chất lượng, nhiều, và đa dạng). 2.LLM Training, Finetuning, Evaluating & Testin…☆281Sep 1, 2025Updated 8 months ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆11Oct 25, 2023Updated 2 years ago
- This is our project for the Mobile Development course at HCMUS.☆12Jan 13, 2023Updated 3 years ago
- MISCA: A Joint Model for Multiple Intent Detection and Slot Filling with Intent-Slot Co-Attention (EMNLP 2023 - Findings)☆33Jul 22, 2024Updated last year
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆41Sep 22, 2024Updated last year
- Transformer OCR☆767Jan 19, 2025Updated last year
- Thư viện chuẩn hóa văn bản Tiếng Việt☆181May 26, 2025Updated 11 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆51May 22, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26May 14, 2024Updated last year
- Final Project for OOP Course - University of Science, VNUHCM☆10Feb 13, 2023Updated 3 years ago
- ☆15Sep 8, 2018Updated 7 years ago
- Python Vietnamese Core NLP Toolkit☆274Sep 26, 2024Updated last year
- A Robustly Optimized BERT Pretraining Approach for Vietnamese☆32Jul 25, 2024Updated last year
- English-Vietnamese Machine Translation using Transformer (Pytorch)☆12Jun 30, 2023Updated 2 years ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆50Jun 3, 2025Updated 11 months ago