ducnt18121997 / Viet-Text-NormalizationLinks
A Python library for text normalization, specifically designed for Vietnamese and English text processing. This library provides comprehensive text normalization capabilities including handling of special characters, numbers, dates, and various text formats.
☆13Updated 9 months ago
Alternatives and similar repositories for Viet-Text-Normalization
Users that are interested in Viet-Text-Normalization are comparing it to the libraries listed below
Sorting:
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Updated last year
- ☆133Updated 8 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆69Updated 3 months ago
- VietASR - Vietnamese Automatic Speech Recognition☆160Updated last year
- Transformation spoken text to written text☆31Updated last year
- Vietnamese Voice Cloning System using Speaker Verification training on multispeaker VITS☆57Updated 2 years ago
- A modified VITS that utilizes phoneme duration's ground truth for better robustness☆151Updated 2 years ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆75Updated last month
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Updated 6 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆68Updated last year
- PhoWhisper: Automatic Speech Recognition for Vietnamese (2024)☆190Updated last year
- Use LoRA technique to improve training Large Language Model☆13Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆101Updated last year
- Vietnamese self-supervised Wav2vec2 model☆61Updated 3 years ago
- A synthesized dataset for Vietnamese TTS task☆66Updated 3 years ago
- python script to download & process data to train a speech-synthesis model of Vietnamese M.C. Nguyễn Ngọc Ngạn☆14Updated last year
- ☆74Updated 2 years ago
- RAG for Vietnamese Wikipedia corpus.☆35Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆104Updated 4 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Updated 2 years ago
- wav2graph: A Framework for Supervised Learning Knowledge Graph from Speech☆95Updated 6 months ago
- ☆47Updated 2 years ago
- ☆11Updated 2 years ago
- ☆26Updated last year
- Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single…☆136Updated last year
- A Vietnamese-English Neural Machine Translation System (INTERSPEECH 2022)☆136Updated last year
- ☆67Updated last year