kiasar / gutenberg_cleanerLinks
a python package for cleaning Gutenberg books and dataset
☆34Updated last month
Alternatives and similar repositories for gutenberg_cleaner
Users that are interested in gutenberg_cleaner are comparing it to the libraries listed below
Sorting:
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Analysis of gutenberg dataset☆44Updated 6 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 5 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- ☆64Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings