Tools for running OCR against files stored in S3
☆120Aug 10, 2022Updated 3 years ago
Alternatives and similar repositories for s3-ocr
Users that are interested in s3-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Jun 23, 2026Updated last week
- this is the code that goes along with the AJC story at https://www.ajc.com/news/state--regional-govt--politics/precinct-closures-harm-vot…☆13Dec 13, 2019Updated 6 years ago
- A tool for telling stories with maps.☆29May 9, 2026Updated last month
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆26Dec 15, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A handbook of best practices and case studies for modern collaborative journalism☆13Sep 16, 2017Updated 8 years ago
- Demonstration project for building out a data news rig.☆10Mar 15, 2022Updated 4 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 3 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- Course materials for SMPA3193, Building Systems for Reporting☆29Apr 25, 2017Updated 9 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆13Sep 15, 2023Updated 2 years ago
- Mapping the growth of Wal-Mart in urban areas.☆14Apr 1, 2015Updated 11 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A work-in-progress SQLite extension for geospatial data☆32Apr 8, 2023Updated 3 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- Create a IIIF-enabled website using universalviewer.io and host it for free on Github Pages☆11May 7, 2019Updated 7 years ago
- Core functions and components for RecogitoJS and Annotorious☆16Nov 9, 2023Updated 2 years ago
- A Flask app to document and test Slack's interactive messages.☆10Mar 19, 2021Updated 5 years ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆31Mar 10, 2023Updated 3 years ago
- Like copytext, but for docs☆12Jun 12, 2018Updated 8 years ago
- A CartoDB client for PHP☆33May 27, 2015Updated 11 years ago
- Martin's HTTP package☆14Apr 23, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- A tool for creating credentials for accessing S3 buckets☆256Dec 21, 2025Updated 6 months ago
- Repository hosting the common code for the entity-fishing clients☆10May 18, 2026Updated last month
- The Datasette macOS application☆135Aug 27, 2024Updated last year
- GitHub template repository for creating new Python Click CLI tools, using the simonw/click-app cookiecutter template☆37May 12, 2024Updated 2 years ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- A custom template for initializing a new Django project the Data Desk way.☆12Feb 18, 2017Updated 9 years ago
- A modern Python library for writing maintainable web scrapers.☆250Nov 22, 2025Updated 7 months ago
- A self-contained example site for django-boundaryservice.☆38May 26, 2011Updated 15 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆15May 22, 2023Updated 3 years ago
- Tools for analyzing Git history using SQLite☆225Dec 21, 2025Updated 6 months ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Feb 2, 2024Updated 2 years ago
- Use DocumentCloud to publish PDFs for humans.☆11Oct 23, 2013Updated 12 years ago
- Interactive, IIIF powered audio/video media player React components library. Storybook Docs: https://samvera-labs.github.io/ramp/☆37Jun 22, 2026Updated last week
- Data Visualizations for the #30DayChartChallenge☆11Apr 15, 2024Updated 2 years ago
- An easily forkable commonplace book of interesting, VERY well-annoted code in any language. Take it and add your own.☆41Jan 28, 2014Updated 12 years ago