raduangelescu / gutenbergpy
Gutenberg cache and query library
☆37Updated 9 months ago
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- Poetic processing, for Python.☆40Updated last year
- A tool for analyzing the word histories of a text.☆34Updated 5 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆63Updated last month
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- WordWanderer – take your text for a walk☆12Updated 6 years ago
- ☆30Updated 8 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 2 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated this week
- A simple interface to the Project Gutenberg corpus.☆327Updated 2 years ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Decompose, transform, and recombine prose into mutated forms.☆12Updated 8 months ago
- Bulk downloader for free ebooks hosted at Project Gutenberg☆19Updated 3 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆29Updated last month
- ☆54Updated last year
- Inspect a URL and estimate if it contains a news story☆39Updated 5 months ago
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆91Updated last year
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- Python API to access glottolog/glottolog☆29Updated 6 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated last year
- A web framework to display Cross Linguistic Linked Data.☆57Updated 3 months ago
- Interactive visualization of Wiktionary words and etymologies.☆92Updated 3 months ago
- Thoughts toward and tutorial on corpus-driven narrative generation☆24Updated 4 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆99Updated 11 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆13Updated last year
- English Resource Grammar☆21Updated this week