ducnt18121997 / Viet-Text-NormalizationLinks
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehensive text normalization capabilities including handling of special characters, numbers, dates, and various text formats.
☆12Updated 3 months ago
Alternatives and similar repositories for Viet-Text-Normalization
Users that are interested in Viet-Text-Normalization are comparing it to the libraries listed below
Sorting:
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆61Updated 6 months ago
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- ☆26Updated last year
- VietASR - Vietnamese Automatic Speech Recognition☆135Updated 8 months ago
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- Transformation spoken text to written text☆30Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- ☆67Updated 2 months ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆161Updated 8 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated last year
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- ☆65Updated 11 months ago
- ☆107Updated last year
- ☆72Updated 2 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆13Updated 11 months ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆48Updated 2 months ago
- ☆67Updated last year
- top 1 Zalo AI challenge 2021 task hum to song☆109Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- ☆46Updated 2 years ago
- Repository for the paper "ViHateT5: Enhancing Hate Speech Detection in Vietnamese with A Unified Text-to-Text Transformer Model" (ACL'202…☆8Updated 11 months ago
- RAG for Vietnamese Wikipedia corpus.☆33Updated last year
- Build English-Vietnamese machine translation with ProtonX Transformer. :D☆70Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆99Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆93Updated last year