epfl-dlab/WikiHist.html

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/epfl-dlab/WikiHist.html)

epfl-dlab / WikiHist.html

This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.

☆14

Alternatives and similar repositories for WikiHist.html

Users that are interested in WikiHist.html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hiroyuki-kasai / SSPW-kmeans
View on GitHub
Sparse simplex projection-based Wasserstein k-means
☆11Jun 10, 2021Updated 5 years ago
ndrezn / wikipedia-histories
View on GitHub
A Python tool to pull the complete edit history of a Wikipedia page
☆21Jul 19, 2026Updated last week
arteria / djangocms-inline-comment
View on GitHub
Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only
☆13Sep 15, 2020Updated 5 years ago
neonbadger / DestinationUnknown
View on GitHub
Hackbright Capstone Project
☆11Apr 14, 2016Updated 10 years ago
aaronclauset / parental-leave
View on GitHub
Data on Paid Parental Leave Policies at US and Canadian Universities 2018
☆18Jun 16, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
floriangeigl / arxiv_converter
View on GitHub
A tiny python2.7 script which converts LaTex projects into arxiv-format. Suggestions are welcome.
☆10Mar 20, 2016Updated 10 years ago
KevinPayravi / CiteUnseen
View on GitHub
DEPRECATED REPO: SEE https://gitlab.wikimedia.org/kevinpayravi/cite-unseen
☆16Sep 17, 2025Updated 10 months ago
sravanareddy / rhymediscovery
View on GitHub
Discovery of Rhyme Schemes in Poetry
☆17Nov 22, 2011Updated 14 years ago
N-Almarwani / DCT_Sentence_Embedding
View on GitHub
Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform
☆17Jul 2, 2020Updated 6 years ago
DavHau / nix-pypi-fetcher
View on GitHub
Pypi Fetcher for Nix with simplified interface. (contains hashes for all packages)
☆15Nov 7, 2023Updated 2 years ago
jkbren / curvy-networkx-edges
View on GitHub
arched links in networkx drawing
☆12Jul 19, 2019Updated 7 years ago
npmSteven / Unraid-VM-CP
View on GitHub
☆11Updated this week
ValterH / automatic-positions-detection-and-scoring-in-jiu-jitsu
View on GitHub
Implementation of "Video-Based Detection of Combat Positions and Automatic Scoring in Jiu-jitsu"
☆20Oct 23, 2024Updated last year
michalgm / ndtv-d3
View on GitHub
Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation
☆18Oct 2, 2015Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
google / wmt19-paraphrased-references
View on GitHub
☆15Nov 5, 2020Updated 5 years ago
jwieting / simple-and-effective-paraphrastic-similarity
View on GitHub
Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".
☆22Oct 3, 2019Updated 6 years ago
SunbirdAI / salt
View on GitHub
Language experimentation tools to accompany the SALT dataset
☆15Updated this week
WebNLG / challenge-2020
View on GitHub
Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020
☆13Feb 1, 2022Updated 4 years ago
psu-libraries / contentdmtools
View on GitHub
PowerShell scripts for processing content into CONTENTdm load packages, batch editing, and batch re-ocr.
☆11Jun 2, 2023Updated 3 years ago
pratham16cse / DualTPP
View on GitHub
Code for "Long Horizon Forecasting With Temporal Point Processes", WSDM 2021
☆21Feb 5, 2022Updated 4 years ago
zalandoresearch / zap
View on GitHub
Multilingual NLP annotation projection
☆53May 20, 2022Updated 4 years ago
arteria / cmsplugin-contact-plus
View on GitHub
With cmsplugin-contact-plus building custom forms for your django-cms project is a breeze. Now it's so easy to build the forms with exact…
☆29Feb 7, 2022Updated 4 years ago
verginer / disamby
View on GitHub
Python package aiding in entity disambiguation based on string and location matching
☆18Nov 2, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
TimSC / image-piecewise-affine
View on GitHub
A piecewise affine image warper for python 2 or 3.
☆26Jun 26, 2016Updated 10 years ago
KnowledgeLab / wisdom-of-polarized-crowds
View on GitHub
☆19May 24, 2019Updated 7 years ago
sravanareddy / rhymedata
View on GitHub
Poetry Annotated with Rhyme Schemes
☆26Nov 22, 2011Updated 14 years ago
fnshr / trans-tidy-data
View on GitHub
Japanese translation of Wickham (2014) "Tidy Data"
☆20Jan 9, 2017Updated 9 years ago
gitronald / domains
View on GitHub
Repository of data on web domains.
☆19May 24, 2023Updated 3 years ago
DrorSh / openalex_to_gbq
View on GitHub
☆18Feb 20, 2026Updated 5 months ago
davestephens / docker-enigma-bbs
View on GitHub
Docker image for ENiGMA BBS software
☆17Aug 3, 2020Updated 5 years ago
alirezamshi-zz / small100
View on GitHub
Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…
☆26Nov 4, 2022Updated 3 years ago
CodenamesAICompetition / Game
View on GitHub
☆25Apr 22, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AndreaSimeone / d3-hypergraph
View on GitHub
implement hypergraphs on D3 force layout
☆26Mar 20, 2018Updated 8 years ago
TurkuNLP / wikibert
View on GitHub
BERT models for many languages created from Wikipedia texts
☆33May 25, 2020Updated 6 years ago
matgrioni / betacode
View on GitHub
A small python package to flexibly convert from betacode to unicode and back.
☆20Jun 22, 2023Updated 3 years ago
CentreForDigitalHumanities / tscan
View on GitHub
T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf
☆19May 28, 2025Updated last year
arananet / pi1541pcb
View on GitHub
This is a PCB based on the Steve White schematic. You just plug on the top of a RPI 3 and you have a fully working 1541 disk emulator :)
☆18Apr 20, 2020Updated 6 years ago
vss-devel / zimmer
View on GitHub
nodejs ZIM file creator
☆21Apr 16, 2021Updated 5 years ago
norakassner / mlama
View on GitHub
☆25Jan 22, 2024Updated 2 years ago