A no-nonsense web scraping tool which removes the crap and preserves the content in epub and pdf formats.
☆41Jan 10, 2016Updated 10 years ago
Alternatives and similar repositories for CleanScrape
Users that are interested in CleanScrape are comparing it to the libraries listed below
Sorting:
- "Save as DAISY" add-in for Microsoft Word☆10Dec 22, 2025Updated 2 months ago
- Revised shell script for parsing .gnmap, .xml, or .nmap port scan results files to a CSV list, lists of IPs per port, web urls, and a sum…☆13Apr 17, 2020Updated 5 years ago
- Collection of somewhat useful stuff for CTF events☆36Jun 24, 2015Updated 10 years ago
- A framework, data and configs for generating and building Tesseract OCR lang.traineddata model files, specifically for Japanese☆10Dec 9, 2013Updated 12 years ago
- Binary Analysis Platform☆74Oct 21, 2013Updated 12 years ago
- Mobbex payment gateway plugin for WooCommerce.☆12Feb 12, 2026Updated 2 weeks ago
- MasTKO is a security tool which detects DNS entries associated with AWS’s EC2 servers susceptible to takeover attack and attempts a takeo…☆11Jun 14, 2023Updated 2 years ago
- This is the active development repository for DIAGRAM accessible Inter actives. Once a project gets fleshed out and is stable a copy of …☆10Sep 17, 2018Updated 7 years ago
- Grecka is a python script to convert Greek to Greeklish based on ELOT 743☆12Aug 4, 2018Updated 7 years ago
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- ☆12Aug 24, 2014Updated 11 years ago
- Speech ANDroid Apps☆20Jan 22, 2014Updated 12 years ago
- (Labeled) Latent Dirichlet Allocation on a sentence level with Gibbs Sampling☆10Mar 27, 2014Updated 11 years ago
- ☆12Oct 1, 2024Updated last year
- Madek main web interface☆21Feb 26, 2026Updated last week
- A GitHub Action to deploy to Dropbox☆11Jul 12, 2023Updated 2 years ago
- Constraint solver based on abstract interpretation☆10Dec 20, 2024Updated last year
- Curso de MongoDB, plan para EducacionIT. Powered by Cesar Casas☆11Aug 17, 2018Updated 7 years ago
- Focused Crawler for VT's CTRNet☆10May 13, 2013Updated 12 years ago
- R package for working with data stored within VERIS framework☆13Dec 22, 2015Updated 10 years ago
- A Graph Rewriting Tool for Plot Generation, uses Graph Grammars☆11Mar 3, 2014Updated 12 years ago
- C++ FreeVerb implementation in STK☆15Apr 22, 2012Updated 13 years ago
- Arch Linux package for the Linux Kernel and modules with grsecurity/PaX patches.☆20Apr 26, 2017Updated 8 years ago
- Adium plugin for Tox IM protocol☆14Sep 6, 2014Updated 11 years ago
- Wiegand data logger, replay device and micro door-controller☆14Jan 5, 2024Updated 2 years ago
- This project demonstrates how to transcribe audio from your media. With this tool, you can then tidy up the transcribed text into bite-si…☆10Feb 12, 2019Updated 7 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Over-engineered tool for symlinking dotfiles☆37Nov 13, 2013Updated 12 years ago
- Super efficient TCP connection between remote processes☆12Apr 7, 2016Updated 9 years ago
- An attempt to run a MirageOS unikernel, built with Solo5, running in Qemu, on a Raspberry Pi 3☆11Mar 12, 2016Updated 9 years ago
- Elasticsearch REPL built on top of Jest☆23May 12, 2015Updated 10 years ago
- Statistical WHOIS parser☆10Apr 17, 2017Updated 8 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Slack post-exploitation script for leaked bot tokens and "d" cookies☆17Nov 18, 2025Updated 3 months ago
- A sensible way to build covers☆12Jan 30, 2019Updated 7 years ago
- ☆10May 17, 2024Updated last year
- Node interface which parses sentences into grammatical structures☆12May 31, 2017Updated 8 years ago
- A probabilistic CKY parser for PCFGs☆19Mar 12, 2014Updated 11 years ago
- WebSocket server implementation of OCaml☆14May 14, 2019Updated 6 years ago