gregjurman / tesserwrap
Python bindings to the Tesseract API
☆66Updated 8 years ago
Related projects: ⓘ
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 10 years ago
- Modularly extensible semantic metadata validator☆83Updated 8 years ago
- Utility library to turn country names into ISO two-letter codes☆65Updated 10 months ago
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- ARCHIVED: A Python API for Tesseract☆20Updated 7 years ago
- Python bindings for CLD2.☆17Updated 6 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Commit Counter Chart is a Python Flask app to view git history using D3.js☆38Updated 8 years ago
- Experiments mining image collections using OpenCV☆64Updated 9 years ago
- Data analysis tool.☆84Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 7 years ago
- ☆24Updated this week
- This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by …☆28Updated 11 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆73Updated 3 months ago
- RGP -- Redis Graph via Python☆30Updated 9 years ago
- Interest is a event-driven web framework on top of aiohttp/asyncio.☆16Updated 3 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 2 years ago
- Telegraphy provides real time events for WSGI Python applications☆202Updated 9 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 8 years ago
- csvcat☆22Updated 8 years ago
- Feedbuffer buffers RSS and Atom syndication feeds, that is to say it caches new feed entries until the news aggregator requests them and …☆19Updated 8 years ago
- A library for extracting tables from PDF files☆90Updated 10 years ago
- Faster replacement for Python's urlparse module☆46Updated 5 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆10Updated 8 years ago
- Internal Stack Exchange☆26Updated 9 years ago
- A skip dict is a Python dictionary which is permanently sorted by value.☆19Updated 9 years ago
- Sometimes you just need a lot of text. Plainstream is a small Python app that provides you with a plain text stream directly from Wikiped…☆24Updated 11 months ago
- Python library and command line tool for converting data from one format to another☆100Updated 4 years ago
- Simple HTTP cache for Python Requests☆99Updated 8 years ago