LunarWatcher / se-data-dump-transformerLinks
Part of a community-driven effort to counteract Stack Exchange's anti-community data dump changes
☆26Updated last month
Alternatives and similar repositories for se-data-dump-transformer
Users that are interested in se-data-dump-transformer are comparing it to the libraries listed below
Sorting:
- Citation bot is a tool to expand and format references at Wikipedia. It retrieves citation data from a variety of sources including Cross…☆67Updated this week
- A Python library to parse MediaWiki WikiText☆317Updated 8 months ago
- Collection of userscripts that are used by/are useful to Charcoal.☆30Updated 3 months ago
- AutoWikiBrowser from the new SF repo☆24Updated last month
- ☆84Updated this week
- A description of the UMP data format used by YouTube☆74Updated last year
- This is a mirror from https://gerrit.wikimedia.org/g/mediawiki/services/parsoid/. See https://www.mediawiki.org/wiki/Developer_access for…☆170Updated this week
- The English Wikipedia twinkle javascript helper☆155Updated this week
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆107Updated 2 months ago
- Language data and utilities☆18Updated 3 weeks ago
- A Wikipedia user script to make replying to comments easier.☆11Updated 4 years ago
- A Python parser for MediaWiki wikicode☆858Updated 7 months ago
- Template repository for creating a localised version of Twinkle☆11Updated last year
- A tool for reviewing Articles for Creation submissions on the English Wikipedia☆43Updated last month
- Library Card Platform for The Wikipedia Library☆90Updated last week
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆43Updated last month
- TypeScript definitions for MediaWiki JS interface☆29Updated last month
- Wikitionary in accessible JSON format☆35Updated 3 years ago
- ☆150Updated 2 weeks ago
- Userscripts for Stack Exchange☆33Updated 2 months ago
- Streaming WARC/ARC library for fast web archive IO☆446Updated last year
- This packages up data for the Open Multilingual Wordnet☆60Updated last week
- Stand-alone WordNet API☆54Updated 3 years ago
- English Dictionaries Project☆179Updated last week
- A suite of tools to analyze page, user and project data of MediaWiki websites☆138Updated this week
- WikiLoop DoubleCheck: a web tool to help review Wikipedia edits easily and collaboratively.☆82Updated last year
- Stack Overflow Moderation Userscripts☆127Updated 7 months ago
- Code to create a database with cleaned up Wiktionary data and then to create ebook dictionaries based on this data.☆32Updated 2 years ago
- Mirror of https://gerrit.wikimedia.org/g/mediawiki/gadgets/RTRC.☆27Updated last year
- Chinese language dictionary with lots of useful information like frequency ranks and percentile (Junda, Giga), Unihan metadata (stroke co…☆19Updated 7 years ago