18F/scrapebox

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/18F/scrapebox)

18F / scrapebox

A simple, system independent infrastructure for performing web scraping. Utilizes Vagrant virtualbox interface and puppet provisioning to create and execute scraping of web content to structured data quickly and easily without modifying your core system.

☆24

Alternatives and similar repositories for scrapebox

Users that are interested in scrapebox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amazon-archives / tesserpy
View on GitHub
ARCHIVED: A Python API for Tesseract
☆20Jul 25, 2017Updated 8 years ago
timClicks / prequel
View on GitHub
Get your data into a database
☆19Oct 6, 2014Updated 11 years ago
18F / doc_processing_toolkit
View on GitHub
Python library to extract text from PDF, and default to OCR when text extraction fails.
☆62Oct 6, 2017Updated 8 years ago
statedecoded / law-identifier
View on GitHub
A collection of regular expressions to identify references to state laws.
☆19Sep 28, 2015Updated 10 years ago
cianjinks / Voxelio
View on GitHub
A Voxel Editor in C++
☆11Oct 27, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pudo / typecast
View on GitHub
Simple type converters: make ints, floats, bools and dates from your strings!
☆11Jul 23, 2016Updated 10 years ago
spidezad / google_search_module
View on GitHub
Retrieve google results using python
☆27Jul 18, 2014Updated 12 years ago
miguelpaz / normalista
View on GitHub
A Jekyll course template for teachers who like to write markdown, host in Github pages and hate worrying about servers
☆24Nov 21, 2017Updated 8 years ago
fly-apps / hello_elixir_sqlite
View on GitHub
An example for building and deploying an Elixir application to Fly using a Dockerfile and SQLite!
☆10Jul 19, 2022Updated 4 years ago
masaun / NFT-badge-for-staking
View on GitHub
NFT Badge for staking on Polygon. This smart contract give a staker a NFT that represents staking period (= vesting period ) which a stak…
☆11May 31, 2021Updated 5 years ago
mossmann / stealthlock
View on GitHub
stuff from my ToorCon 2015 talk
☆14Oct 27, 2015Updated 10 years ago
illagrenan / django-make-app
View on GitHub
Define models and fields using YAML and generate app for Django with views, forms, templates etc.
☆13Jan 6, 2018Updated 8 years ago
fmartingr / iosfu
View on GitHub
iOS forensics utility
☆13May 8, 2018Updated 8 years ago
MRGEffitas / Write-into-screen
View on GitHub
☆21Aug 7, 2014Updated 11 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
prashantnirgun / express-mysql-passport-jwt-api
View on GitHub
Create a API server with mysql table crud without ORM, Passport, JWT
☆12Feb 13, 2026Updated 5 months ago
EricSchles / awesome_stuff
View on GitHub
☆28Jul 12, 2018Updated 8 years ago
jabbalaci / screenshot.py
View on GitHub
Taking a screenshot of a webpage.
☆48Nov 28, 2015Updated 10 years ago
edent / Open-Source-Shakespeare
View on GitHub
A MySQL database of the complete works of William Shakespeare
☆33Aug 25, 2025Updated 10 months ago
s0lst1c3 / keyboardsnitch
View on GitHub
☆10May 8, 2016Updated 10 years ago
masaun / tranche-lending-and-borrowing-for-agriculture-market
View on GitHub
This is the smart contract that provide fixed-rate borrowing for farmers and lending for investors by utilizing a bond tranche scheme. 👩…
☆11Dec 15, 2021Updated 4 years ago
techiediaries / invoice-electron-angularjs-app
View on GitHub
An invoice desktop app built using Electron and Angularjs.
☆14Jan 8, 2018Updated 8 years ago
srri-zz / OpenRelay
View on GitHub
Peer based web hosting
☆16Sep 18, 2015Updated 10 years ago
ccastillop / iOS-Corrupted-Backup-Reader
View on GitHub
Recovering data from the iPhone Ipod Touch corrupted backups
☆12Oct 29, 2016Updated 9 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
daomaker / dao-staking
View on GitHub
DAO Maker staking (venture yield)
☆12Dec 13, 2023Updated 2 years ago
k3170makan / GooDork3-dev
View on GitHub
The shards of code that will soon become the next GooDork version
☆17Mar 6, 2013Updated 13 years ago
ttntm / recept0r
View on GitHub
A simple recipes app based on vue.js and backed by Fauna DB
☆12Aug 29, 2021Updated 4 years ago
ms-dev-1 / uni-sdr-link
View on GitHub
A small application to allow Unitrunker to control SDR Console VFO frequencies
☆15Jul 19, 2015Updated 11 years ago
eddietejeda / administrate-field-json
View on GitHub
A plugin to show and edit JSON objects within Administrate.
☆12Feb 10, 2022Updated 4 years ago
OrderAndCh4oS / tezos-music-player-next
View on GitHub
☆13May 22, 2023Updated 3 years ago
stars-labs / metatx-Java-demo
View on GitHub
Metatx Java demo
☆11Feb 18, 2021Updated 5 years ago
CoastalResilienceNetwork / GeositeFramework
View on GitHub
Mapping Framework powering TNC Coastal Resilience programs
☆13Mar 20, 2021Updated 5 years ago
evgenykuzyakov / nft-mints
View on GitHub
NFT Mints in realtime on NEAR blockchain using Events stream
☆12Oct 15, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ninjality / convert-csv-markdown
View on GitHub
Convert CSV to Markdown files.
☆13Apr 10, 2018Updated 8 years ago
demantz / flex_hackrf
View on GitHub
A scanning FLEX decoder for the HackRF based on gr-pager
☆19Sep 29, 2014Updated 11 years ago
skvamme / HTML-vector-graphics
View on GitHub
Convert a DXF vector graphics file to HTML 5 <canvas> and/or <svg> drawing primitives
☆17Mar 20, 2021Updated 5 years ago
dirkcgrunwald / hackrf-power
View on GitHub
a version of rtlsdr_power for hackrf, with gps logging
☆15Aug 28, 2015Updated 10 years ago
cuker / django-photoprocessor
View on GitHub
Automated image processing for Django
☆20Aug 29, 2019Updated 6 years ago
pombredanne / SimSearch
View on GitHub
Implementation of Bayesian Sets for fast similarity searches.
☆14Oct 2, 2011Updated 14 years ago
fairdataihub / fairdataihub.org
View on GitHub
Website of the FAIR Data Innovations Hub
☆12Updated this week