Riamse / ceterach
An interface for interacting with MediaWiki
☆37Updated 3 years ago
Alternatives and similar repositories for ceterach:
Users that are interested in ceterach are comparing it to the libraries listed below
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Simple Python Wrapper around MediaWiki API☆30Updated 2 years ago
- Trough: Big data, small databases.☆41Updated 9 months ago
- A javascript tool to visualize the diff's in wikipedia☆35Updated 2 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Link Wikidata items to large catalogs☆96Updated last month
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆150Updated 3 months ago
- Scripts for Wikidata☆20Updated 3 weeks ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆57Updated 9 months ago
- A set of utilities for accessing and processing MediaWiki data.☆55Updated 6 years ago
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated 10 months ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆42Updated 8 months ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆115Updated 8 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated last month
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago
- A Python client for the RDF web-services provided by Geonames (http://www.geonames.org).☆22Updated 9 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Github mirror of "wikidata/query/blazegraph" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer…☆15Updated 5 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago