jaepil / pdfminer3kLinks
Python 3 port of pdfminer
☆186Updated 6 years ago
Alternatives and similar repositories for pdfminer3k
Users that are interested in pdfminer3k are comparing it to the libraries listed below
Sorting:
- python wrapper for the ZXing barcode library☆274Updated 3 years ago
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆927Updated 7 years ago
- Constants used in Chinese text processing☆372Updated 6 months ago
- An extendable docx file format parser and converter☆192Updated last month
- Run JavaScript code from Python (EOL: https://gist.github.com/doloopwhile/8c6ec7dd4703e8a44e559411cb2ea221)☆715Updated 5 years ago
- [译] Python 自然语言处理 中文第二版☆63Updated 7 years ago
- Graphic Verification Code☆16Updated 2 years ago
- Scrapy extension to control spiders using JSON-RPC☆300Updated 5 years ago
- Deprecated. Use PyEcharts instead. https://github.com/pyecharts/pyecharts☆416Updated 3 years ago
- 基于行块分布函数的通用网页正文(及图片)抽取 - Python版本☆115Updated 8 years ago
- The Python implementation for looking up Chinese administrative divisions.☆129Updated 5 years ago
- Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音☆829Updated 3 weeks ago
- Automatically exported from code.google.com/p/pyh with chinese docs☆71Updated 3 years ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,072Updated 9 years ago
- Create, read, and modify Excel .xlsx files☆111Updated 4 years ago
- The simplest way to extract text from PDFs in Python☆428Updated 2 years ago
- GtWeb Python Sdk☆83Updated 8 years ago
- Python module for JSON data encoding, including jsonlint. See the project Wiki here on Github. Also read the README at the bottom of th…☆303Updated 5 years ago
- Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.☆119Updated 3 years ago
- Compiled PyV8 for Mac OS X☆103Updated 12 years ago
- The upload website script built on python Flask with jQuery File Upload☆61Updated 8 years ago
- Insert HTML or Markdown into a Word document☆85Updated 4 years ago
- a chinese segment base on crf☆233Updated 6 years ago
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆45Updated 11 years ago
- universal character encoding detector☆404Updated last month
- Hanzi Converter for Traditional and Simplified Chinese☆188Updated 5 years ago
- CSS Selectors for Python☆298Updated last month
- A utility to read and write PDFs with Python☆334Updated 3 years ago
- PhantomJS Downloader for Scrapy, Yeah!☆94Updated 10 years ago
- some useful tools functions☆77Updated 2 years ago