brandonrobertz/sentence-autosegmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/brandonrobertz/sentence-autosegmentation)

brandonrobertz / sentence-autosegmentation

Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation

☆37

Alternatives and similar repositories for sentence-autosegmentation

Users that are interested in sentence-autosegmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

notAI-tech / deepsegment
View on GitHub
A sentence segmenter that actually works!
☆304Aug 18, 2020Updated 5 years ago
ldulcic / text-segmentation
View on GitHub
Unsupervised text segmentation based on Latent Dirichlet Allocation and Topic Tiling
☆24Aug 6, 2016Updated 9 years ago
Yoctol / text-normalizer
View on GitHub
Normalize text string
☆12Nov 6, 2018Updated 7 years ago
searchableai / ChainCQG
View on GitHub
☆13Feb 11, 2021Updated 5 years ago
andreaferretti / charade
View on GitHub
A server for multilanguage, composable NLP API in Python
☆28Dec 8, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ottokart / sequence-labeler
View on GitHub
Neural network sequence labeling model - some sloppy modifications to the original toolkit to enable punctuation restoration in unsegment…
☆10Jan 8, 2017Updated 9 years ago
ottokart / punctuator2
View on GitHub
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
☆683Sep 19, 2021Updated 4 years ago
statedecoded / law-identifier
View on GitHub
A collection of regular expressions to identify references to state laws.
☆19Sep 28, 2015Updated 10 years ago
lsvih / AtTGen
View on GitHub
Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023
☆13May 19, 2023Updated 3 years ago
readbeyond / lachesis
View on GitHub
lachesis automates the segmentation of a transcript into closed captions
☆35Jan 26, 2017Updated 9 years ago
Orange-OpenSource / COQAR
View on GitHub
a corpus containing 4.5K conversations from the Conversational Question-Answering dataset CoQA, for a total of 53K follow-up question-ans…
☆16Jun 12, 2023Updated 3 years ago
MehwishFatimah / GPT2_Summarization
View on GitHub
Finetune GPT2 for text summarization
☆17Aug 16, 2021Updated 4 years ago
18F / linkify-citations
View on GitHub
Turns legal citations in the DOM into links
☆20Mar 15, 2017Updated 9 years ago
kemitchell / wordy-words
View on GitHub
list of English words with shorter synonyms
☆24Apr 6, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
prdwb / attentive_history_selection
View on GitHub
☆33May 10, 2021Updated 5 years ago
ncbi-nlp / ezTag
View on GitHub
Web interface that allows users to perform computer-assisted text annotation
☆14Jan 19, 2023Updated 3 years ago
geyang / deep-auto-punctuation
View on GitHub
a pytorch implementation of auto-punctuation learned character by character
☆139Nov 15, 2020Updated 5 years ago
mayhewsw / pytorch-truecaser
View on GitHub
A simple neural truecaser written in pytorch and allennlp.
☆35Jun 17, 2024Updated 2 years ago
konklone / oversight.garden
View on GitHub
Bringing together the oversight community's work.
☆26May 3, 2020Updated 6 years ago
yeontaek / XLNET-Korean-Model
View on GitHub
☆19Mar 31, 2020Updated 6 years ago
ajupton / PySpark_guides
View on GitHub
Reference and learning notebooks on the use of Spark for ML and analytical applications
☆12Mar 1, 2019Updated 7 years ago
hslh / pie-detection
View on GitHub
Automatic Detection of Potentially Idiomatic Expressions
☆12Feb 19, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
anoperson / DeepIE
View on GitHub
An Information Extraction Framework with Deep Learning developed at New York University
☆15Oct 27, 2016Updated 9 years ago
srvk / lm_build
View on GitHub
Adapting your own Language Model for Kaldi
☆63Jan 8, 2019Updated 7 years ago
yuboona / punctuation-restoration-pytorch
View on GitHub
A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP
☆24Jan 11, 2021Updated 5 years ago
PlagueHO / LoopbackAdapter
View on GitHub
A PowerShell module for creating and removing Loopback Network Adapters on Windows using Device Conslole (DevCon.exe)
☆15Feb 26, 2021Updated 5 years ago
CLUEbenchmark / KGQA
View on GitHub
Knowledge Graph based Question Answering benchmark.
☆10Feb 1, 2020Updated 6 years ago
ottokart / punctuator
View on GitHub
An LSTM RNN for restoring missing punctuation in unsegmented text.
☆78Sep 24, 2016Updated 9 years ago
vackosar / keras-punctuator
View on GitHub
Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.
☆83Oct 9, 2020Updated 5 years ago
ChenhaoJiang / LeetCode-Solution
View on GitHub
My solution in Python for the problem of LeetCode
☆11Aug 13, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
ltgoslo / factorizer
View on GitHub
☆16May 14, 2024Updated 2 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
bellbind / python-simplexquery
View on GitHub
simple native XQuery processing module using xqilla.
☆11Mar 24, 2012Updated 14 years ago
teddysum / korean_evaluation
View on GitHub
☆10Jun 5, 2025Updated last year
sion-zcfei / CQG
View on GitHub
The codes for ACL2022 paper “CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question Generation
☆23Oct 23, 2022Updated 3 years ago
iamvishnuks / Audio2Spectrogram
View on GitHub
This tool can be used to convert mp3 to processable wav files, generate chunks of wav's and generate spectrograms.
☆30Feb 4, 2018Updated 8 years ago