oliverguhr / fullstop-deep-punctuation-predictionLinks

A model that predicts the punctuation of English, Italian, French and German texts.

☆80

Alternatives and similar repositories for fullstop-deep-punctuation-prediction

Users that are interested in fullstop-deep-punctuation-prediction are comparing it to the libraries listed below

Sorting:

oliverguhr / deepmultilingualpunctuation
A python package for deep multilingual punctuation prediction.
☆128Updated 11 months ago
xashru / punctuation-restoration
Punctuation Restoration using Transformer Models for High-and Low-Resource Languages
☆219Updated last year
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆116Updated 2 years ago
isi-nlp / uroman
Universal Romanizer that can convert any unicode script to roman (latin) script
☆214Updated last year
lumaku / ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
☆339Updated last year
revdotcom / speech-datasets
Various speech datasets made available to the public
☆126Updated 7 months ago
roedoejet / g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆171Updated last month
Felflare / rpunct
📝An easy-to-use package to restore punctuation of the text.
☆117Updated 2 years ago
cvqluu / simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆149Updated last year
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
cadia-lvl / punctuation-prediction
Support tools for punctuation and boundary detection for ASR output.
☆57Updated 2 years ago
rhasspy / gruut
A tokenizer, text cleaner, and phonemizer for many human languages.
☆321Updated 8 months ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆106Updated 5 months ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆60Updated last year
RuABraun / texterrors
☆37Updated 3 months ago
xinjli / transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆168Updated 2 years ago
feldberlin / timething
Timething is a library for aligning text transcripts with their audio recordings.
☆122Updated 8 months ago
ccoreilly / wav2vec2-service
☆38Updated 3 years ago
chrisspen / punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆35Updated 4 years ago
ftyers / commonvoice-utils
Linguistic processing for Common Voice
☆57Updated last year
jimbozhang / speechocean762
A non-native English corpus for pronunciation scoring task
☆144Updated last year
Open-Speech-EkStep / indic-punct
☆43Updated 2 years ago
repodiac / german_transliterate
Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…
☆33Updated 4 years ago
alphacep / whisper-prompts
OpenAI Whisper Prompt Examples
☆52Updated 2 years ago
Open-Speech-EkStep / ULCA-asr-dataset-corpus
☆47Updated 2 years ago
tomaarsen / TTSTextNormalization
Convert English text from written expressions into spoken forms
☆25Updated 3 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆82Updated 2 years ago
spring-media / DeepPhonemizer
Grapheme to phoneme conversion with deep learning.
☆393Updated last year
notAI-tech / fastPunct
Punctuation restoration and spell correction experiments.
☆251Updated 4 years ago
YuanGongND / gopt
Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".
☆179Updated 2 years ago