documentcloud / pdfshaverLinks
Shave pages off of PDFs as images
☆59Updated 7 years ago
Alternatives and similar repositories for pdfshaver
Users that are interested in pdfshaver are comparing it to the libraries listed below
Sorting:
- ☆175Updated 8 years ago
- Ruby script to parse CSV file(s) into a database.☆53Updated 11 years ago
- an approval process automation tool☆107Updated 6 years ago
- A Ruby client for interacting with ProPublica Campaign Finance API☆54Updated 2 years ago
- Encyclopedia of Life☆61Updated 6 years ago
- Structured Data from PDF image-based files☆89Updated 12 years ago
- Since I originally wrote this a module called request has come on the scene. You might want to try that before mucking about with extrac…☆26Updated 9 years ago
- Exploring extracting tables from a PDF to CSV using PDF.JS☆105Updated 9 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 12 years ago
- generate rules from lists of words☆16Updated 4 years ago
- GitHub activity dashboard☆79Updated 13 years ago
- GitHub issues made awesome☆62Updated 3 years ago
- PHAROS Image Database☆38Updated 7 years ago
- Analysis of Github Commits Comments☆36Updated 8 years ago
- trying shingling / resemblance / simhash / sketching to do some data deduping☆97Updated 10 years ago
- Check that your CSV files are valid☆74Updated last year
- A rubular.com clone for javascript regular expressions.☆137Updated 5 years ago
- This is a helper function that utilises d3.js and Crossfilter to create interdependent interactive histograms.☆60Updated 12 years ago
- Gives you binaries like mysql2csv, mysql2json, and mysql2xml, and Ruby classes to match.☆85Updated 10 years ago
- A data scraping framework based on Open Civic Data's Pupa☆67Updated 5 years ago
- [DEPRECATED] An open API server, data import tools, and sample apps to help small businesses search for opportunities to work with the U.…☆102Updated 7 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- A node.js library for extracting data from scanned forms.☆117Updated 2 years ago
- A tool for editing CSV & JSON files from your computer & from GitHub.☆48Updated 8 years ago
- Break Apart Documents into Images, Text, Pages and PDFs☆834Updated last year
- An open website for opening Congress.☆48Updated 9 years ago
- Split email messages into an object stream☆23Updated 2 months ago
- Automated, headless browser testing (using PhantomJS).☆99Updated 5 years ago
- BDD-style acceptance test framework for web applications based on PhantomJS.☆168Updated 4 years ago
- The frontend service for GOV.UK Verify☆20Updated last year