VinAIResearch/PhoST

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VinAIResearch/PhoST)

VinAIResearch / PhoST

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)

☆25

Alternatives and similar repositories for PhoST

Users that are interested in PhoST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

datquocnguyen / VnDT
View on GitHub
VnDT: A Vietnamese Dependency Treebank
☆24Nov 6, 2021Updated 4 years ago
datquocnguyen / PhoW2V
View on GitHub
Pre-trained Word2Vec syllable- and word-level embeddings for Vietnamese
☆54Aug 8, 2023Updated 2 years ago
VinAIResearch / ViText2SQL
View on GitHub
ViText2SQL: A dataset for Vietnamese Text-to-SQL semantic parsing (EMNLP-2020 Findings)
☆39Jul 22, 2024Updated 2 years ago
VinAIResearch / PhoNLP
View on GitHub
PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)
☆148Dec 31, 2024Updated last year
VinAIResearch / PhoNER_COVID19
View on GitHub
COVID-19 Named Entity Recognition for Vietnamese (NAACL 2021)
☆73Jul 22, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
thelinhbkhn2014 / VnCoreNLP_Wrapper
View on GitHub
☆25Aug 28, 2024Updated last year
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
VinAIResearch / BARTpho
View on GitHub
BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)
☆105Jul 22, 2024Updated 2 years ago
nguyenvulebinh / lyric-alignment
View on GitHub
Vietnamese song lyric alignment framework
☆68Dec 11, 2022Updated 3 years ago
CLC-HCMUS / ViMs-Dataset
View on GitHub
☆16Nov 18, 2020Updated 5 years ago
tuvuumass / task-transferability
View on GitHub
Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.
☆48Mar 8, 2021Updated 5 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tuanio / conformer-rnnt
View on GitHub
Conformer RNN-Transducer
☆14May 25, 2022Updated 4 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
janhq / WhisperSpeech
View on GitHub
Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…
☆16Jan 20, 2025Updated last year
VinAIResearch / RecGPT
View on GitHub
RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)
☆42Sep 22, 2024Updated last year
kamilakesbi / DiarizersLM
View on GitHub
☆15Jul 16, 2024Updated 2 years ago
hitz-zentroa / whisper-lm
View on GitHub
Add n-gram and large language model (LLM) support to Whisper models.
☆43May 6, 2025Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
nhungnt7 / PLANTA
View on GitHub
Planning for Success: Exploring LLM Long-term Planning Capabilities in Table Understanding
☆17Jun 17, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mailong25 / bert-vietnamese-question-answering
View on GitHub
Vietnamese question answering system with BERT
☆117Jan 12, 2023Updated 3 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
manhph2211 / ViTTS
View on GitHub
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system In general, I used Portaspeech as an…
☆12Nov 24, 2023Updated 2 years ago
zenanz / ChemPatentEmbeddings
View on GitHub
☆21Aug 18, 2020Updated 5 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
langmaninternet / VietnameseTextNormalizer
View on GitHub
Thư viện chuẩn hóa văn bản Tiếng Việt
☆180May 26, 2025Updated last year
Kyoto-University-Speech-and-Audio / feng-asr-ser
View on GitHub
☆10Sep 6, 2020Updated 5 years ago
thelinhbkhn2014 / Text2PhonemeSequence
View on GitHub
☆53Aug 28, 2024Updated last year
sangHa0411 / Llama-Instruction-Tuning
View on GitHub
☆10Dec 28, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
nguyenvulebinh / visen
View on GitHub
ViSen is library to format tone of Vietnamese sentences
☆22Nov 9, 2021Updated 4 years ago
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
VinAIResearch / XPhoneBERT
View on GitHub
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
☆354Jul 22, 2024Updated 2 years ago
AsoSoft / AsoSoft-TTS-Speech-Corpus-for-Central-Kurdish
View on GitHub
AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech
☆23Jun 24, 2022Updated 4 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
baochi0212 / LaVy
View on GitHub
Pioneering in Vietnamese Multimodal Large Language Model
☆53Jan 23, 2025Updated last year