klokoy / pdf2htmlEX_dockerLinks
Automated docker build for pdf2htmlEX
☆22Updated 9 years ago
Alternatives and similar repositories for pdf2htmlEX_docker
Users that are interested in pdf2htmlEX_docker are comparing it to the libraries listed below
Sorting:
- Scrapy extension to control spiders using JSON-RPC☆300Updated 6 years ago
- A scrapy pipeline which send items to Elastic Search server☆322Updated 3 years ago
- MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…☆358Updated 4 years ago
- Scrapy Middleware to set a random User-Agent for every Request.☆202Updated 6 years ago
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆118Updated 12 years ago
- Code for example in this post: http://brunorocha.org/python/microservices-with-python-rabbitmq-and-nameko.html☆190Updated 9 years ago
- ☆104Updated 9 years ago
- ☆143Updated 10 years ago
- OAuth2 for Chinese social sites☆318Updated 9 years ago
- This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica☆259Updated 5 years ago
- MongoDB extensions for Scrapy☆44Updated 11 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆275Updated 11 months ago
- A fully functional REST Web API. Powered by Eve.☆256Updated 7 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆929Updated 7 years ago
- Python script to automate document conversions using LibreOffice/OpenOffice.org☆360Updated 4 years ago
- Django testrunner for mongoengine☆46Updated 14 years ago
- DJANGO + FAbric + GUnicorn + NGInx + Supervisor deployment☆102Updated 5 years ago
- Access Control List for Tornado (or just plain Python)☆29Updated 13 years ago
- Mongodb support for scrapy☆101Updated 8 years ago
- A RabbitMQ Scheduler for Scrapy☆87Updated 3 years ago
- This is the code from my Django and Socket.IO realtime tutorial which you can find at http://maxburstein.com/blog/realtime-django-using-n…☆132Updated 12 years ago
- Monitors prices of Amazon products via Product Advertising API☆156Updated 6 years ago
- Whoosh indexing capabilities for Flask-SQLAlchemy☆283Updated 2 years ago
- ☆167Updated 7 years ago
- Scrapy examples crawling Craigslist☆201Updated 9 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆160Updated last week
- A blog app with django-rest-framework and angularjs.☆160Updated 10 years ago
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆278Updated last year
- Stream-Framework demonstration app☆139Updated 7 years ago
- A simple (Python) query builder for Elasticsearch☆80Updated 4 years ago