Tools for running OCR against files stored in S3
☆120Aug 10, 2022Updated 3 years ago
Alternatives and similar repositories for s3-ocr
Users that are interested in s3-ocr are comparing it to the libraries listed below
Sorting:
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆17Mar 12, 2026Updated last week
- A tool for telling stories with maps.☆29Feb 26, 2026Updated 3 weeks ago
- Data and scripts relating to the publishing of the House expenditure reports, and hopefully the Senate's in future.☆25Dec 15, 2020Updated 5 years ago
- Demonstration project for building out a data news rig.☆10Mar 15, 2022Updated 4 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 2 years ago
- Example of running Datasette on Azure Functions☆11Mar 27, 2021Updated 4 years ago
- HOCR Specification Python Parser☆12Sep 23, 2015Updated 10 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- A library and command-line tool for fetching Facebook Pages' published posts.☆13Jul 18, 2017Updated 8 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆13Sep 15, 2023Updated 2 years ago
- Mapping the growth of Wal-Mart in urban areas.☆15Apr 1, 2015Updated 10 years ago
- A work-in-progress SQLite extension for geospatial data☆32Apr 8, 2023Updated 2 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- Core functions and components for RecogitoJS and Annotorious☆16Nov 9, 2023Updated 2 years ago
- A Flask app to document and test Slack's interactive messages.☆10Mar 19, 2021Updated 5 years ago
- A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.☆31Mar 10, 2023Updated 3 years ago
- Like copytext, but for docs☆12Jun 12, 2018Updated 7 years ago
- A CartoDB client for PHP☆33May 27, 2015Updated 10 years ago
- A tool for creating credentials for accessing S3 buckets☆247Dec 21, 2025Updated 3 months ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 9 months ago
- The Datasette macOS application☆134Aug 27, 2024Updated last year
- GitHub template repository for creating new Python Click CLI tools, using the simonw/click-app cookiecutter template☆36May 12, 2024Updated last year
- A modern Python library for writing maintainable web scrapers.☆250Nov 22, 2025Updated 3 months ago
- Machine assisted dossiers☆19Oct 12, 2017Updated 8 years ago
- A custom template for initializing a new Django project the Data Desk way.☆12Feb 18, 2017Updated 9 years ago
- ☆14Aug 27, 2022Updated 3 years ago
- A self-contained example site for django-boundaryservice.☆38May 26, 2011Updated 14 years ago
- Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying☆15May 22, 2023Updated 2 years ago
- Tracking my progress in doing GIS/Geospatial work in Python 3.x☆12May 23, 2016Updated 9 years ago
- Tools for analyzing Git history using SQLite☆221Dec 21, 2025Updated 3 months ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆26Dec 24, 2014Updated 11 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Feb 2, 2024Updated 2 years ago
- Use DocumentCloud to publish PDFs for humans.☆11Oct 23, 2013Updated 12 years ago
- Data Visualizations for the #30DayChartChallenge☆11Apr 15, 2024Updated last year
- An easily forkable commonplace book of interesting, VERY well-annoted code in any language. Take it and add your own.☆41Jan 28, 2014Updated 12 years ago
- Interactive, IIIF powered audio/video media player React components library. Styleguidist Docs: https://samvera-labs.github.io/ramp/☆36Updated this week