rufuspollock-okfn / labs-opd
☆124Updated 3 years ago
Alternatives and similar repositories for labs-opd:
Users that are interested in labs-opd are comparing it to the libraries listed below
- Matches a category of Google's Taxonomy to product that is described in any kind of text data☆61Updated 6 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- A lightweight, standardized library accessing files and datasets, especially tabular ones (CSV, Excel).☆73Updated 2 years ago
- Python interface to Apache PDFBox command-line tools.☆75Updated 2 years ago
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- Working with hOCR in Javascript☆127Updated 2 years ago
- Silently print from within a browser using javascript and PrintNode remote printing service.☆70Updated 2 years ago
- TSheets API Documentation☆12Updated last year
- All USA zip codes split to separate kml files☆45Updated 4 years ago
- Amadeus for Developers APIs Testing Data Collection☆25Updated 2 years ago
- JSON API data provider for react-admin.☆73Updated 2 years ago
- python library for extracting html microdata☆166Updated last year
- A simple OpenRefine reconciliation service that runs on top of a CSV file☆120Updated 9 years ago
- name search for people and entities on the EU, OFAC and UN sanction lists☆25Updated 4 years ago
- Java libraries to read and write real estate data in common formats (e.g. OpenImmo, ImmoXML, Kyero, Trovit, IDX)☆52Updated last year
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆150Updated 3 months ago
- Automatically extracts structured information from webpages☆108Updated 2 years ago
- Extract receipt info☆44Updated 2 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- HOCR Specification Python Parser☆13Updated 9 years ago
- Starter workflow for creating electronically signed PDF agreements.☆138Updated 2 years ago
- Adapting the python library OCRmyPDF to run in an AWS Lambda Function☆17Updated 2 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Updated 7 years ago
- A search engine for Open Data☆53Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆123Updated last year
- Legal Entity Name Understanding☆19Updated 2 weeks ago
- A case management app built with Lowdefy.☆32Updated last year
- Activity timer powerup for Trello☆49Updated 6 months ago