gambolputty / textstelleLinks
Textstelle is a collection of corpora for the creation of bots and other things that generate text 🤖
☆20Updated 3 years ago
Alternatives and similar repositories for textstelle
Users that are interested in textstelle are comparing it to the libraries listed below
Sorting:
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Neo4j powered web application for multimedia collections: bring graph-based exploration and crowd-based indexation.☆39Updated 5 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Metadata from Project Gutenberg☆41Updated last week
- Prototype SOLR-powered web archive exploration UI.☆43Updated 5 years ago
- Highlighting various OCR formats directly in Solr☆86Updated 2 weeks ago
- This repository has migrated to:☆100Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- API implementation, User Interface, and more modules of the IPTC EXTRA project☆13Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.☆55Updated 2 months ago
- OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.☆120Updated last month
- Example SPARQL queries, mostly for working with ZBW data sets☆16Updated 10 months ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- A tool to extract canonical references from text.☆20Updated 4 years ago
- Command line OAI-PMH harvester and client with built-in cache.☆126Updated this week
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Python 3 library for processing historical English☆67Updated 11 months ago
- A tool to analyse, browse and query Wikidata☆83Updated 2 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated last month
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- A structured list of text corpora, created for use with a corpus downloader.☆13Updated 8 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated last year
- Web application for management formal representations of knowledge, like controlled vocabularies, taxonomies, thesauri and glossaries☆133Updated last week
- German stopwords collection☆86Updated 2 years ago