FerdinandZhong / punctuatorLinks
A small seq2seq punctuator tool based on DistilBERT
☆53Updated 11 months ago
Alternatives and similar repositories for punctuator
Users that are interested in punctuator are comparing it to the libraries listed below
Sorting:
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 7 months ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated 2 years ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆181Updated 6 years ago
- Neural end-to-end Speech Translation Toolkit☆309Updated 3 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆224Updated last year
- A module for normalising text.☆173Updated 4 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆123Updated 6 months ago
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆54Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆287Updated 2 months ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆153Updated 5 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- A unified versatile interface for dialogue datasets☆18Updated 2 years ago
- ☆16Updated 4 years ago
- wav2vec2 asr with transformers☆16Updated 4 years ago
- A python package for deep multilingual punctuation prediction.☆152Updated last year
- ☆105Updated 4 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆44Updated 2 years ago
- Mirror of SRILM☆57Updated 5 years ago
- ☆76Updated 4 years ago
- one script for xls-r/xlsr/whisper fine-tuning☆42Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Library for Textless Spoken Language Processing☆554Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Updated 3 years ago
- Awesome TTS☆61Updated 4 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆157Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.☆56Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago