tokgolich / doctotext
Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
☆127Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for doctotext
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆66Updated 3 weeks ago
- Some much needed maintenance of http://silvercoders.com/en/products/doctotext/☆27Updated 2 years ago
- The first open-source C++ development library for OFD.☆59Updated 11 months ago
- 文件解析 doctotext 源码 4.0-20140202 版本☆14Updated 7 years ago
- wv is a library which allows access to Microsoft Word files. It can load and parse Word 2000, 97, 95 and 6 file formats. (These are the f…☆14Updated 6 years ago
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆55Updated 9 years ago
- A wrap of Zlib, making the use of ZLib easier in ZIP compressing/extracting.☆44Updated 3 months ago
- 网易云信跨平台C++开发框架☆57Updated last year
- 一个很简陋的ofdEditor☆39Updated 5 years ago
- ☆51Updated 3 years ago
- C/Python library to extract text from MS doc files☆11Updated 2 years ago
- a simple non-validating css1 and html parser written in C++ tuned for iOS and Mac☆90Updated 4 years ago
- C++/C library to construct Excel .xls files in code. Official git repo.☆44Updated 3 years ago
- CEF浏览器☆64Updated 4 years ago
- CEF3 + Qt5, VS 项目。启用Flash。类讲解。Clone from https://github.com/tishion/QCefView.☆20Updated 6 years ago
- Fast and easy to use C++ library that creates or modifies Microsoft Word files without installing Word.☆89Updated 2 weeks ago
- ☆70Updated 5 years ago
- cximage 7.0.1 mirror☆80Updated 12 years ago
- This is not the poppler repository. Please see https://poppler.freedesktop.org/☆52Updated 14 years ago
- Qt/C++ library based on google pdfium☆15Updated 3 years ago
- GNU character conversion library☆125Updated last week
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆105Updated 3 years ago
- window下虚拟网卡TUN/TAP的帮助函数,可实现虚拟网卡的创建、设置IP、增加路由、读取和写入数据等操作☆14Updated 3 years ago
- extract text from MS-WORD's .doc binary format file☆33Updated last year
- A common library providing clean and easy consuming of CEF☆99Updated last week
- 用duilib做的miniblink的浏览器☆99Updated 2 years ago
- A simple office file reader can extract content and summary information from .doc,.docx,.ppt,.pptx files without Microsoft Office or inte…☆70Updated 6 years ago
- A simple and easy-to-use cross-platform C++ wrapper for the SQLite API. Fork of the CppSQLite project, originally by Rob Groves, currentl…☆133Updated 6 months ago
- DuiLib with cef3 browser control☆38Updated 4 years ago
- PDFium Reader☆63Updated last year