This is a repo containing all code and steps taken to download, setup the process and convert the whole English Wikipedia history from Wikitext to HTML format.
☆14Jun 8, 2020Updated 5 years ago
Alternatives and similar repositories for WikiHist.html
Users that are interested in WikiHist.html are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utility to compute number of mandates based on election results, uting D'Hondt method☆11Sep 6, 2013Updated 12 years ago
- Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only☆13Sep 15, 2020Updated 5 years ago
- PyMix - The Python mixture package☆16Nov 9, 2015Updated 10 years ago
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Oct 7, 2020Updated 5 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps☆11Apr 18, 2019Updated 6 years ago
- Hackbright Capstone Project☆11Apr 14, 2016Updated 9 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- code for "Determining Gains Acquired from Word Embedding Quantitatively Using Discrete Distribution Clustering" ACL 2017☆21Nov 21, 2018Updated 7 years ago
- texrex web page cleaning & ClaraX random walk crawler☆11Dec 13, 2021Updated 4 years ago
- Data on Paid Parental Leave Policies at US and Canadian Universities 2018☆17Jun 16, 2023Updated 2 years ago
- Pypi Fetcher for Nix with simplified interface. (contains hashes for all packages)☆15Nov 7, 2023Updated 2 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Jan 18, 2021Updated 5 years ago
- ☆11Aug 15, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Nov 5, 2020Updated 5 years ago
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Jan 13, 2016Updated 10 years ago
- Interactive Network Graph Visualization for NDTV-generate graphs using D3 animation☆18Oct 2, 2015Updated 10 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Python code for training models in the ACL paper, "Simple and Effective Paraphrastic Similarity from Parallel Translations".☆22Oct 3, 2019Updated 6 years ago
- DEPRECATED REPO: SEE https://gitlab.wikimedia.org/kevinpayravi/cite-unseen☆16Sep 17, 2025Updated 6 months ago
- ☆19May 24, 2019Updated 6 years ago
- community site☆15Oct 25, 2018Updated 7 years ago
- Repository of data on web domains.☆19May 24, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Multilingual NLP annotation projection☆52May 20, 2022Updated 3 years ago
- ☆27Oct 22, 2012Updated 13 years ago
- ☆16Nov 6, 2016Updated 9 years ago
- Poetry Annotated with Rhyme Schemes☆25Nov 22, 2011Updated 14 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Nov 2, 2023Updated 2 years ago
- With cmsplugin-contact-plus building custom forms for your django-cms project is a breeze. Now it's so easy to build the forms with exact…☆29Feb 7, 2022Updated 4 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- LoRa Basics Modem integration in Zephyr OS☆19Dec 20, 2024Updated last year
- implement hypergraphs on D3 force layout☆26Mar 20, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Engine for Warlight AI Challenge 2☆18Jun 26, 2015Updated 10 years ago
- ZS4IE: A Toolkit for Zero-Shot Information Extraction with Simple Verbalizations☆29Mar 28, 2022Updated 3 years ago
- ☆26Apr 22, 2022Updated 3 years ago
- Bipartite Configuration Model for Python☆16Dec 10, 2020Updated 5 years ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated 9 months ago
- A modern web app for Sudoku inspired by Cracking the Cryptic☆23Feb 6, 2026Updated last month
- Scripts for preprocessing the CoNLL-2005 SRL dataset.☆24Mar 28, 2019Updated 6 years ago