A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆22Jun 5, 2025Updated 9 months ago
Alternatives and similar repositories for PhoST
Users that are interested in PhoST are comparing it to the libraries listed below
Sorting:
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆48Jun 3, 2025Updated 9 months ago
- ☆19Jun 28, 2022Updated 3 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆66Jan 1, 2025Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- Zalo Text-To-Speech for python☆11May 10, 2021Updated 4 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆53Aug 8, 2023Updated 2 years ago
- ☆16Apr 24, 2025Updated 10 months ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Jul 22, 2024Updated last year
- VietASR - Vietnamese Automatic Speech Recognition☆164Oct 29, 2024Updated last year
- ViStreamASR - Real-Time Vietnamese Speech Recognition☆52Jul 12, 2025Updated 7 months ago
- ☆16Nov 18, 2020Updated 5 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Jun 24, 2022Updated 3 years ago
- Russian accentuator and IPA transcriber☆16Sep 10, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Dec 31, 2024Updated last year
- 一个第三方的泠鸢yousa歌声数据集☆17Nov 28, 2023Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated last year
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆72Jul 22, 2024Updated last year
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Official repo for the Vietnam-Celeb dataset☆26Aug 27, 2023Updated 2 years ago
- We provide benchmark datasets for evaluating Vietnamese processing models: UIT-ViQuAD, ViNewsQA, UIT-VSFC, UIT-ViIC, UIT-ViNames, UIT-VSM…☆20Jun 19, 2021Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago
- Thư viện chuẩn hóa văn bản Tiếng Việt☆180May 26, 2025Updated 9 months ago
- RusTTS is an unofficial Coqui TTS implementation.☆21Aug 12, 2022Updated 3 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago