FerdinandZhong / punctuatorLinks
A small seq2seq punctuator tool based on DistilBERT
☆53Updated 8 months ago
Alternatives and similar repositories for punctuator
Users that are interested in punctuator are comparing it to the libraries listed below
Sorting:
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 4 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆220Updated last year
- Neural end-to-end Speech Translation Toolkit☆309Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- A module for normalising text.☆173Updated 3 years ago
- wav2vec2 asr with transformers☆16Updated 3 years ago
- Mirror of SRILM☆57Updated 5 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆282Updated 7 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆79Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 months ago
- ☆76Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆118Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆132Updated last year
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆81Updated 4 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- ☆14Updated 4 years ago
- ☆12Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆117Updated 3 months ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆215Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆56Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆391Updated 4 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆150Updated last year
- Awesome TTS☆60Updated 4 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆48Updated 4 years ago