This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.
☆14Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for WikiHist.html
Users that are interested in WikiHist.html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Discovery of Rhyme Schemes in Poetry☆17Nov 22, 2011Updated 14 years ago
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 7 years ago
- Hackbright Capstone Project☆11Apr 14, 2016Updated 10 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆18Jan 18, 2021Updated 5 years ago
- Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".☆22Oct 3, 2019Updated 6 years ago
- DEPRECATED REPO: SEE https://gitlab.wikimedia.org/kevinpayravi/cite-unseen☆16Sep 17, 2025Updated 7 months ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- A prototype research tool to demonstrate how metadata and automated analysis can be combined.☆14Dec 3, 2022Updated 3 years ago
- An index data structure for approximate string search.☆23May 6, 2019Updated 7 years ago
- Safe serialization of ML models☆18Apr 21, 2023Updated 3 years ago
- A piecewise affine image warper for python 2 or 3.☆26Jun 26, 2016Updated 9 years ago
- Multilingual NLP annotation projection☆53May 20, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆26Nov 4, 2022Updated 3 years ago
- ☆16Nov 6, 2016Updated 9 years ago
- Poetry Annotated with Rhyme Schemes☆25Nov 22, 2011Updated 14 years ago
- **Sferes2 module** A unifying modular framework for Quality-Diversity algorithms☆22Nov 6, 2020Updated 5 years ago
- Allows the use of BibTeX citations within a Pelican site☆25Apr 14, 2020Updated 6 years ago
- LoRa Basics Modem integration in Zephyr OS☆20Dec 20, 2024Updated last year
- ☆26Apr 22, 2022Updated 4 years ago
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- This tutorial accompanies the NSF-CBMS Conference and Software Day on Topological Methods in Machine Learning and Artificial Intelligence…☆22May 18, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- scikit-learn like interface to chainer☆22Mar 8, 2016Updated 10 years ago
- simple python interface to SMAC.☆21Mar 27, 2018Updated 8 years ago
- Python tools to scrape, load and manage campaign finance data housed on the Federal Election Commission website☆24Nov 14, 2018Updated 7 years ago
- ☆25Jan 22, 2024Updated 2 years ago
- Create a webmanifest file☆19Aug 9, 2020Updated 5 years ago
- A modern web app for Sudoku inspired by Cracking the Cryptic☆24Feb 6, 2026Updated 3 months ago
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆24Mar 28, 2019Updated 7 years ago
- Acoustic distance measure for comparing pronunciations☆17Aug 2, 2022Updated 3 years ago
- Wikidata lexemes presentations☆23Jan 30, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Language-Agnostic Website Embedding and Classification☆48Jan 22, 2024Updated 2 years ago
- PyDataLondonTutorial☆26May 5, 2016Updated 10 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Sep 22, 2020Updated 5 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆57Jan 18, 2023Updated 3 years ago
- ☆19Oct 6, 2020Updated 5 years ago
- Wikipedia DB Dump Server + wikitext parser in Go/Golang☆14Jun 25, 2019Updated 6 years ago
- Encode / decode varints.☆14May 24, 2021Updated 4 years ago