Riamse / ceterach
An interface for interacting with MediaWiki
☆37Updated 3 years ago
Alternatives and similar repositories for ceterach:
Users that are interested in ceterach are comparing it to the libraries listed below
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆44Updated 7 years ago
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- Python package for working with MediaWiki wikis☆105Updated 3 years ago
- Ingestors extract the contents of mixed unstructured documents into structured (followthemoney) data.☆59Updated 2 weeks ago
- A set of utilities for accessing and processing MediaWiki data.☆55Updated 6 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆60Updated 2 weeks ago
- Link Wikidata items to large catalogs☆96Updated 3 weeks ago
- WARC and ARC indexing and discovery tools.☆122Updated 2 weeks ago
- export data from twitter archive and visualize it☆25Updated 2 years ago
- Code for my Wikimedia Labs Tools account☆93Updated 7 months ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆38Updated 11 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 8 years ago
- A javascript tool to visualize the diff's in wikipedia☆35Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 5 years ago
- Scripts for Wikidata☆20Updated this week
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Command line interface to Wikidata Query Service☆55Updated 11 months ago
- Simple Python Wrapper around MediaWiki API☆30Updated 2 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- Adding links to full text in Wikipedia references☆37Updated last year
- Specification of NAF, the NLP annotation format☆21Updated 4 years ago
- Trough: Big data, small databases.☆40Updated 8 months ago
- This is a REST Server endpoint built using Flask and Python.☆24Updated 2 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆115Updated 8 years ago
- Do you have a question? Ask Wikidata! This tool lets you enter a question and tries to parse it. If it understands what you want to know,…☆46Updated 9 years ago