This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.
☆14Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for WikiHist.html
Users that are interested in WikiHist.html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Discovery of Rhyme Schemes in Poetry☆17Nov 22, 2011Updated 14 years ago
- Utility to compute number of mandates based on election results, uting D'Hondt method☆11Sep 6, 2013Updated 12 years ago
- Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only☆13Sep 15, 2020Updated 5 years ago
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Oct 7, 2020Updated 5 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Pypi Fetcher for Nix with simplified interface. (contains hashes for all packages)☆15Nov 7, 2023Updated 2 years ago
- A python interface for the CIViC db application☆12Apr 8, 2026Updated last week
- ☆15Nov 5, 2020Updated 5 years ago
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Oct 2, 2015Updated 10 years ago
- DEPRECATED REPO: SEE https://gitlab.wikimedia.org/kevinpayravi/cite-unseen☆16Sep 17, 2025Updated 7 months ago
- Language experimentation tools to accompany the SALT dataset☆15Apr 4, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- A prototype research tool to demonstrate how metadata and automated analysis can be combined.☆14Dec 3, 2022Updated 3 years ago
- An index data structure for approximate string search.☆23May 6, 2019Updated 6 years ago
- Safe serialization of ML models☆18Apr 21, 2023Updated 2 years ago
- ☆11Feb 8, 2022Updated 4 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆28Feb 8, 2023Updated 3 years ago
- A piecewise affine image warper for python 2 or 3.☆26Jun 26, 2016Updated 9 years ago
- ☆19May 24, 2019Updated 6 years ago
- community site☆14Oct 25, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multilingual NLP annotation projection☆52May 20, 2022Updated 3 years ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆25Nov 4, 2022Updated 3 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Nov 6, 2020Updated 5 years ago
- With cmsplugin-contact-plus building custom forms for your django-cms project is a breeze. Now it's so easy to build the forms with exact…☆29Feb 7, 2022Updated 4 years ago
- LoRa Basics Modem integration in Zephyr OS☆20Dec 20, 2024Updated last year
- Data exploration done quick.☆19Jul 22, 2021Updated 4 years ago
- SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval☆50Sep 20, 2022Updated 3 years ago
- Engine for Warlight AI Challenge 2☆18Jun 26, 2015Updated 10 years ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- A super-repository of MediaWiki extensions not hosted at Wikimedia.☆25Updated this week
- This tutorial accompanies the NSF-CBMS Conference and Software Day on Topological Methods in Machine Learning and Artificial Intelligence…☆21May 18, 2019Updated 6 years ago
- simple python interface to SMAC.☆21Mar 27, 2018Updated 8 years ago
- Python tools to scrape, load and manage campaign finance data housed on the Federal Election Commission website☆24Nov 14, 2018Updated 7 years ago
- A small python package to flexibly convert from betacode to unicode and back.☆20Jun 22, 2023Updated 2 years ago
- ☆25Jan 22, 2024Updated 2 years ago