mathisve / LatinTextDataset
Latin text dataset for machine learning and procedural text generation
☆15Updated 7 months ago
Alternatives and similar repositories for LatinTextDataset:
Users that are interested in LatinTextDataset are comparing it to the libraries listed below
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Morphological analyzer and lemmatizer for Latin.☆25Updated 2 months ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated last year
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- a python package for cleaning Gutenberg books and dataset☆32Updated last year
- In-browser OCR of Ancient Greek and Latin☆25Updated 2 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- A tool for analyzing the word histories of a text.☆34Updated last month
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- Named entity annotation tool☆27Updated last year
- The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to …☆30Updated 2 months ago
- Python library for automatic analysis of Ancient Greek hexameter. The algorithm uses linguistic rules and finite-state technology.☆20Updated 11 months ago
- Planning Seminar and 2016-2017 WS and SS Courses☆10Updated 5 years ago
- Latin BERT☆58Updated 6 months ago
- Scripts for scraping metadata from Project Gutenberg books, via GITenberg.☆19Updated 6 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆10Updated 2 years ago
- Tutorials for the CLTK☆52Updated 4 years ago
- LexInfo - Data Category Ontology for OntoLex-Lemon☆22Updated last year
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 8 months ago
- A command-line program to download text corpora.☆33Updated 7 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Wikidata embedding☆51Updated 2 months ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆69Updated 3 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- Language models are open knowledge graphs ( non official implementation )☆13Updated 4 years ago
- TEI Reader Python Library☆17Updated last year
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 7 months ago