ducnt18121997 / Viet-Text-NormalizationLinks
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehensive text normalization capabilities including handling of special characters, numbers, dates, and various text formats.
☆11Updated 2 months ago
Alternatives and similar repositories for Viet-Text-Normalization
Users that are interested in Viet-Text-Normalization are comparing it to the libraries listed below
Sorting:
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- Transformation spoken text to written text☆30Updated last year
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆60Updated 5 months ago
- ☆26Updated last year
- Vietnamese self-supervised Wav2vec2 model☆62Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆98Updated 3 years ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆41Updated 3 weeks ago
- ☆9Updated last month
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 11 months ago
- VietASR - Vietnamese Automatic Speech Recognition☆130Updated 7 months ago
- Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn☆26Updated 2 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆113Updated last year
- ☆70Updated 2 years ago
- top 1 Zalo AI challenge 2021 task hum to song☆109Updated 3 years ago
- ☆62Updated 9 months ago
- ☆69Updated last year
- ☆33Updated last year
- ☆50Updated 2 years ago
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆54Updated last year
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆87Updated 11 months ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆11Updated 5 months ago
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆154Updated 6 months ago