A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆22Jun 5, 2025Updated 9 months ago
Alternatives and similar repositories for PhoST
Users that are interested in PhoST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- VnDT: A Vietnamese Dependency Treebank☆24Nov 6, 2021Updated 4 years ago
- ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)☆36Jul 22, 2024Updated last year
- Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese☆53Aug 8, 2023Updated 2 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆149Dec 31, 2024Updated last year
- COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)☆72Jul 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆25Aug 28, 2024Updated last year
- ☆19Jun 28, 2022Updated 3 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆104Jul 22, 2024Updated last year
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- ☆16Nov 18, 2020Updated 5 years ago
- Zalo Text-To-Speech for python☆11May 10, 2021Updated 4 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆48Mar 8, 2021Updated 5 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- ☆14Nov 22, 2022Updated 3 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 9 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 9 months ago
- Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)☆17Sep 19, 2021Updated 4 years ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆41Sep 22, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Vietnamese question answering system with BERT☆117Jan 12, 2023Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- VietASR - Vietnamese Automatic Speech Recognition☆165Updated this week
- Thư viện chuẩn hóa văn bản Tiếng Việt☆181May 26, 2025Updated 9 months ago
- ☆22Aug 18, 2020Updated 5 years ago
- 一个第三方的泠鸢yousa歌声数据集☆17Nov 28, 2023Updated 2 years ago
- ☆11Sep 6, 2020Updated 5 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆53Aug 28, 2024Updated last year
- The WikiHow-based and DeScript-based datasets for the event prediction task (IJCNLP 2017)☆15Mar 1, 2019Updated 7 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- ☆16Apr 24, 2025Updated 11 months ago
- ☆10Dec 28, 2023Updated 2 years ago
- XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)☆348Jul 22, 2024Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆20Jun 24, 2022Updated 3 years ago