lehinevych / MediaWikiAPILinks
Python wrapper for the MediaWiki API to access and parse data from Wikipedia
β41Updated last week
Alternatives and similar repositories for MediaWikiAPI
Users that are interested in MediaWikiAPI are comparing it to the libraries listed below
Sorting:
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.β41Updated 5 years ago
- Next-generation Punkt sentence boundary detection with zero dependenciesβ17Updated 2 months ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graphβ37Updated 11 months ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/β184Updated 5 months ago
- an experimental implementation of Burrow's delta in Python 3β21Updated 3 years ago
- Finds linguistic patterns effortlesslyβ36Updated last year
- A python package to simulate typographical errors.β35Updated last year
- A compound word splitter for Pythonβ48Updated 3 years ago
- Generate reports for spaCy models.β29Updated 3 years ago
- Generate a SQLite database from Wikipedia & Wikidata dumps.β35Updated last year
- A lightweight python library for working with Akoma Ntoso documents.β17Updated 2 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 2 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidataβ94Updated 2 years ago
- Alternative robots parser module for Pythonβ18Updated last week
- Binary Python bindings for poppler utils for content extractionβ42Updated 4 years ago
- πΈ Train floret vectorsβ18Updated 2 years ago
- A Python implementation of Lunr.js πβ197Updated 3 months ago
- A helper library full of URL-related heuristics.β69Updated 2 weeks ago
- Sort-friendly URI Reordering Transform (SURT) python moduleβ42Updated 10 months ago
- python functions for applied use of schema.orgβ36Updated 3 years ago
- β23Updated last year
- Python API to query a SPARQL endpointβ32Updated 3 years ago
- Parse numbers written in natural languageβ117Updated 8 months ago
- Extract networks of entities from journalistic reportingβ48Updated last year
- π Additional lookup tables and data resources for spaCyβ105Updated 3 weeks ago
- spaCy extension for Visual Studio Codeβ32Updated 3 months ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearchβ70Updated 3 years ago
- ποΈ Radically lightweight command-line interfacesβ107Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (incluβ¦β64Updated last year