ducnt18121997 / Viet-Text-Normalization
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehensive text normalization capabilities including handling of special characters, numbers, dates, and various text formats.
☆11Updated last month
Alternatives and similar repositories for Viet-Text-Normalization:
Users that are interested in Viet-Text-Normalization are comparing it to the libraries listed below
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆59Updated 4 months ago
- Transformation spoken text to written text☆30Updated 11 months ago
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆13Updated 8 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 10 months ago
- Vietnamese self-supervised Wav2vec2 model☆61Updated 2 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 11 months ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆83Updated 10 months ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆98Updated 3 years ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- ☆68Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- ☆26Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆52Updated last year
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆21Updated 9 months ago
- VietASR - Vietnamese Automatic Speech Recognition☆127Updated 6 months ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆146Updated 5 months ago
- ☆46Updated last year
- ☆9Updated last week
- ☆50Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated last year
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆28Updated last week
- This repo aims to build a web app that supports speech recognition system It's simple to use and understand☆38Updated last year
- A synthesized dataset for Vietnamese TTS task☆63Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- ☆62Updated 8 months ago