dohliam / more-stoplistsLinks
stoplists for African languages generated from the ASP corpus
☆14Updated 9 years ago
Alternatives and similar repositories for more-stoplists
Users that are interested in more-stoplists are comparing it to the libraries listed below
Sorting:
- command-line tool to extract taxonomies from Wikidata☆128Updated 6 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Web hub based on Wikidata☆37Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 7 months ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io☆79Updated 2 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 10 years ago
- Wikidata embedding☆51Updated 9 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Github mirror of "wikidata/query/gui" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆80Updated 6 months ago
- Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆60Updated 7 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 3 years ago
- Website content for annotatorjs.org☆16Updated 4 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆93Updated last year
- An online annotation platform for teaching and learning in the humanities.☆108Updated last week
- read and edit a Wikibase instance from the command line☆235Updated 3 months ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago
- Convert between DOM Range instances and text quotes.☆34Updated 2 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆96Updated 3 years ago
- A visual timeline authoring tool that extracts temporal information from freeform text☆65Updated 2 years ago