heraclex12 / vietpuncLinks

Vietnamese Punctuation Prediction using Pretrained Language Models

☆13

Alternatives and similar repositories for vietpunc

Users that are interested in vietpunc are comparing it to the libraries listed below

Sorting:

VinAIResearch / PhoST
A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)
☆22Updated last month
JSALT2022CodeSwitchingASR / generating-code-switched-audio
☆12Updated 5 months ago
nguyenvulebinh / spoken-norm
Transformation spoken text to written text
☆30Updated last year
ductuantruong / speaker_age_estimation_ssl_study
Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Updated 2 years ago
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
sholokhovalexey / online-speaker-clustering
☆17Updated 2 years ago
nguyenvulebinh / ViStreamASR
ViStreamASR - Real-Time Vietnamese Speech Recognition
☆27Updated 2 weeks ago
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
MarceloSancinetti / epa-gop-pykaldi
☆25Updated 3 years ago
shivammehta25 / BetterFastSpeech2
Just another FastSpeech 2 but cleaner code :)
☆26Updated last year
v-nhandt21 / Viphoneme
Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA
☆93Updated last year
yuhangear / wenet-android
☆12Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
HLTCHKUST / ASCEND
ASCEND Chinese-English code-switching dataset
☆24Updated 3 years ago
Open-Speech-EkStep / data-acquisition-pipeline
☆17Updated 4 years ago
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
liutaocode / DiarizationMetricInOne
Diarization Metric in One: current support DER, JER, CDER, SER, and BER
☆9Updated 2 years ago
khanld / chunkformer
ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription
☆48Updated 2 months ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
leduckhai / MultiMed-ST
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation
☆13Updated 3 months ago
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆29Updated 6 years ago
HKAB / whisper-finetune-vietnamese
Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM
☆38Updated last year
diego-fustes / asr-rescoring
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Updated 4 years ago
pengzhendong / Torchaudio-Forced-Aligner
Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.
☆11Updated 6 months ago
iisys-hof / HUI-Audio-Corpus-German
This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…
☆31Updated 2 years ago
hlt-mt / Speech-MASSIVE
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆22Updated 10 months ago
edemattos / asr
Automatic Speech Recognition at the University of Edinburgh.
☆16Updated 4 years ago
Nathan-Roll1 / PSST
Prosodic Speech Segmentation with Transformers
☆25Updated last year