Convert Wikipedia database dumps into plaintext files
☆326May 23, 2021Updated 4 years ago
Alternatives and similar repositories for PlainTextWikipedia
Users that are interested in PlainTextWikipedia are comparing it to the libraries listed below
Sorting:
- ☆15Mar 11, 2024Updated last year
- Converts sound files to mp4s for twitter upload☆17Oct 31, 2018Updated 7 years ago
- Finds packages that require updates on a python environment☆22Updated this week
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Nov 7, 2022Updated 3 years ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆31Nov 4, 2023Updated 2 years ago
- JavaScript Sequence Alignment Viewer☆11Mar 25, 2022Updated 3 years ago
- Rest admin-like endpoints for django☆13Oct 3, 2016Updated 9 years ago
- CLIP OS Manifest☆11Nov 4, 2020Updated 5 years ago
- Tool for cleaning old and redundant backups☆14Dec 26, 2025Updated 2 months ago
- ☆11Nov 16, 2022Updated 3 years ago
- Rob Pike's simple regex matcher converted to Go☆11Aug 14, 2022Updated 3 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- ☆70Nov 30, 2022Updated 3 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- Interpretable feature construction from taxonomies for text classification☆18Apr 4, 2022Updated 3 years ago
- A simple no-fuss selfhostable password generator.☆11Nov 10, 2020Updated 5 years ago
- Little silly chrome extension - Launch The Sims' Buy Mode music when entering Amazon☆11Jul 29, 2019Updated 6 years ago
- ☆13Nov 11, 2023Updated 2 years ago
- Pure CSS indentation lines for Hacker News with two-second installation via uBlock Origin☆11Jan 31, 2021Updated 5 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Jul 23, 2023Updated 2 years ago
- Add website scraping abilities to Datasette☆66Mar 4, 2023Updated 2 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Jan 23, 2025Updated last year
- 📈 All data from my life — location, health, work, play, and more — open sourced☆14Jul 5, 2022Updated 3 years ago
- ☆10Jun 26, 2021Updated 4 years ago
- Use Markov chain generators in Tracery/cheapbotsdonequick bots☆17Jul 6, 2018Updated 7 years ago
- ELECTRA MODEL NLP☆13Apr 8, 2020Updated 5 years ago
- Tool for running transformations on columns in a SQLite database☆31Aug 2, 2021Updated 4 years ago
- Python remake of the Commodore PET Star Trek BASIC game from 1977☆13May 12, 2022Updated 3 years ago
- codebase for the Text-based NP Enrichment (TNE) paper☆19Mar 12, 2024Updated last year
- ☆12Sep 5, 2022Updated 3 years ago
- A simple way to load Django-like fixtures into the datastore.☆15Jan 26, 2017Updated 9 years ago
- ☆17Mar 29, 2022Updated 3 years ago
- hackernews data☆34Dec 14, 2025Updated 2 months ago
- Export your (or other people's) Goodreads data to SQLite☆90Aug 27, 2020Updated 5 years ago
- Command line tool to write to x86 boot flash chips via the PCH☆14Mar 30, 2017Updated 8 years ago
- Datasette plugin for streaming SQLite database backups to S3, using Litestream!☆19Jan 20, 2026Updated last month
- ☆14Jan 7, 2024Updated 2 years ago
- Comparateur de scénarios énergétiques☆16Sep 12, 2024Updated last year
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year