tokgolich / doctotext
Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
☆130Updated 7 years ago
Alternatives and similar repositories for doctotext:
Users that are interested in doctotext are comparing it to the libraries listed below
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆80Updated 2 weeks ago
- wv is a library which allows access to Microsoft Word files. It can load and parse Word 2000, 97, 95 and 6 file formats. (These are the f…☆17Updated 6 years ago
- Some much needed maintenance of http://silvercoders.com/en/products/doctotext/☆27Updated 2 years ago
- The first open-source C++ development library for OFD.☆61Updated last year
- 文件解析 doctotext 源码 4.0-20140202 版本☆14Updated 7 years ago
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆112Updated 3 years ago
- ☆52Updated 3 years ago
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆62Updated 9 years ago
- A light-weight C++ HTML processing library based on pugixml☆19Updated 12 years ago
- PDFium Reader☆66Updated last year
- Qt/C++ library based on google pdfium☆15Updated 4 years ago
- A common library providing clean and easy consuming of CEF☆114Updated last month
- c++ library wrapper of 7zip☆134Updated 2 years ago
- a simple non-validating css1 and html parser written in C++ tuned for iOS and Mac☆90Updated 4 years ago
- 一个很简陋的ofdEditor☆39Updated 5 years ago
- Text editors made for VNoteX.☆40Updated 2 months ago
- C/Python library to extract text from MS doc files☆11Updated 2 years ago
- extract text from MS-WORD's .doc binary format file☆34Updated 2 years ago
- 用duilib做的miniblink的浏览器☆100Updated 3 years ago
- Platform independent c++ library providing ftp client functionality.☆296Updated 7 months ago
- cli for extracting text from PDF files (and maybe possibly tables)☆78Updated 3 weeks ago
- A wrap of Zlib, making the use of ZLib easier in ZIP compressing/extracting.☆45Updated 8 months ago
- Qt5 interface of the popular PDF library MuPDF☆116Updated 10 years ago
- ☆71Updated 5 years ago
- A c++ epub reading library☆17Updated 10 years ago
- 网易云信跨平台C++开发框架☆58Updated last year
- Modern C++20 library for creating Microsoft Word Document (.docx file).☆124Updated last month
- A simple and easy-to-use cross-platform C++ wrapper for the SQLite API. Fork of the CppSQLite project, originally by Rob Groves, currentl…☆148Updated 11 months ago
- window下虚拟网卡TUN/TAP的帮助函数,可实现虚拟网卡的创建、设置IP、增加路由、读取和写入数据等操作☆14Updated 4 years ago
- cximage 7.0.1 mirror☆81Updated 13 years ago