FerdinandZhong / punctuatorLinks
A small seq2seq punctuator tool based on DistilBERT
☆52Updated 7 months ago
Alternatives and similar repositories for punctuator
Users that are interested in punctuator are comparing it to the libraries listed below
Sorting:
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 3 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆219Updated last year
- Neural end-to-end Speech Translation Toolkit☆307Updated 3 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆282Updated 6 months ago
- A module for normalising text.☆173Updated 3 years ago
- ☆103Updated 4 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆41Updated 2 years ago
- Mirror of SRILM☆57Updated 5 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆45Updated 2 years ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆53Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated last month
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆117Updated 2 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆47Updated last month
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆207Updated 2 years ago
- ☆76Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- Awesome TTS☆59Updated 3 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆116Updated 2 months ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- Python implementation of an N-gram language model with Laplace smoothing and sentence generation.☆86Updated 7 years ago
- CoVoST: A Large-Scale Multilingual Speech-To-Text Translation Corpus (CC0 Licensed)☆386Updated 3 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆152Updated 4 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆116Updated 10 months ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆81Updated 4 years ago