This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.
☆14Jun 8, 2020Updated 6 years ago
Alternatives and similar repositories for WikiHist.html
Users that are interested in WikiHist.html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python tool to pull the complete edit history of a Wikipedia page☆21Apr 21, 2026Updated last month
- Utility to compute number of mandates based on election results, uting D'Hondt method☆11Sep 6, 2013Updated 12 years ago
- Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only☆13Sep 15, 2020Updated 5 years ago
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Oct 7, 2020Updated 5 years ago
- Hackbright Capstone Project☆11Apr 14, 2016Updated 10 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆18Jan 18, 2021Updated 5 years ago
- A curated list of awesome tools, frameworks, and resources for web research data collection, including tools that support observational m…☆23Dec 16, 2025Updated 6 months ago
- ☆15Nov 5, 2020Updated 5 years ago
- DEPRECATED REPO: SEE https://gitlab.wikimedia.org/kevinpayravi/cite-unseen☆16Sep 17, 2025Updated 9 months ago
- Language experimentation tools to accompany the SALT dataset☆15Jun 8, 2026Updated last week
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- A piecewise affine image warper for python 2 or 3.☆26Jun 26, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- community site☆14Oct 25, 2018Updated 7 years ago
- Repository of data on web domains.☆19May 24, 2023Updated 3 years ago
- Multilingual NLP annotation projection☆53May 20, 2022Updated 4 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆26Nov 4, 2022Updated 3 years ago
- Example code producing novelty, transience, and resonance for a sample of legislative speech during the French Revolution.☆17Oct 21, 2025Updated 7 months ago
- Python package aiding in entity disambiguation based on string and location matching☆18Nov 2, 2023Updated 2 years ago
- With cmsplugin-contact-plus building custom forms for your django-cms project is a breeze. Now it's so easy to build the forms with exact…☆29Feb 7, 2022Updated 4 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated last year
- LoRa Basics Modem integration in Zephyr OS☆20Dec 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data exploration done quick.☆19Jul 22, 2021Updated 4 years ago
- ☆26Apr 22, 2022Updated 4 years ago
- Bipartite Configuration Model for Python☆16Dec 10, 2020Updated 5 years ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated last year
- Build a Medical Q&A system using LangChain and Mistral 7B☆18Jan 4, 2024Updated 2 years ago
- ☆16Feb 26, 2020Updated 6 years ago
- scikit-learn like interface to chainer☆22Mar 8, 2016Updated 10 years ago
- Create a webmanifest file☆19Aug 9, 2020Updated 5 years ago
- A modern web app for Sudoku inspired by Cracking the Cryptic☆24Feb 6, 2026Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Acoustic distance measure for comparing pronunciations☆17Aug 2, 2022Updated 3 years ago
- Wikidata lexemes presentations☆23Jan 30, 2026Updated 4 months ago
- Language-Agnostic Website Embedding and Classification☆48Jan 22, 2024Updated 2 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Sep 22, 2020Updated 5 years ago
- Bangla Unicode Normalization☆23May 26, 2024Updated 2 years ago
- Wikipedia DB Dump Server + wikitext parser in Go/Golang☆15Jun 25, 2019Updated 6 years ago
- Translate plain ASCII quotation marks and other characters into “smart” typographic HTML entities.☆44Jun 17, 2025Updated last year