FerdinandZhong / punctuatorLinks
A small seq2seq punctuator tool based on DistilBERT
☆50Updated 6 months ago
Alternatives and similar repositories for punctuator
Users that are interested in punctuator are comparing it to the libraries listed below
Sorting:
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- The case study and multilingfual performance of ICASSP submission☆24Updated 2 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆151Updated 4 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆45Updated 2 years ago
- ☆59Updated last year
- Mirror of SRILM☆56Updated 4 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆115Updated last month
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆117Updated last month
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆215Updated 10 months ago
- ☆103Updated 3 years ago
- Punctuation restoration in ASR text☆33Updated 5 years ago
- Open source library for few shot NLP☆78Updated 2 years ago
- ☆52Updated 4 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆46Updated last week
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆62Updated 3 years ago
- asr2k☆50Updated last year
- cLang-8 is a dataset for grammatical error correction.☆106Updated 2 years ago
- ☆76Updated 3 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 4 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆115Updated 9 months ago
- ☆57Updated 2 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Updated last year
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆52Updated 2 years ago
- ICU based universal language tokenizer☆32Updated 3 years ago
- A module for normalising text.☆174Updated 3 years ago