tokgolich / doctotext
Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
☆129Updated 7 years ago
Alternatives and similar repositories for doctotext:
Users that are interested in doctotext are comparing it to the libraries listed below
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆79Updated this week
- Some much needed maintenance of http://silvercoders.com/en/products/doctotext/☆27Updated 2 years ago
- wv is a library which allows access to Microsoft Word files. It can load and parse Word 2000, 97, 95 and 6 file formats. (These are the f…☆16Updated 6 years ago
- 文件解析 doctotext 源码 4.0-20140202 版本☆14Updated 7 years ago
- C/Python library to extract text from MS doc files☆11Updated 2 years ago
- The first open-source C++ development library for OFD.☆61Updated last year
- ☆52Updated 3 years ago
- extract text from MS-WORD's .doc binary format file☆34Updated last year
- C++/C library to construct Excel .xls files in code. Official git repo.☆45Updated 4 years ago
- Modern C++20 library for creating Microsoft Word Document (.docx file).☆118Updated last week
- c++ library wrapper of 7zip☆134Updated 2 years ago
- A light-weight C++ HTML processing library based on pugixml☆19Updated 12 years ago
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆111Updated 3 years ago
- PDFium Reader☆66Updated last year
- Cross platform C/C++ library with C#, Java, Python, Progress 4GL wrappers and command line tools for generating Microsoft Word .DOCX (Ope…☆167Updated 8 years ago
- Duilib扩展控件收集☆35Updated 6 years ago
- A simple office file reader can extract content and summary information from .doc,.docx,.ppt,.pptx files without Microsoft Office or inte…☆72Updated 6 years ago
- CEF3 + Qt5, VS 项目。启用Flash。类讲解。Clone from https://github.com/tishion/QCefView.☆20Updated 6 years ago
- This is not the poppler repository. Please see https://poppler.freedesktop.org/☆52Updated 15 years ago
- A wrap of Zlib, making the use of ZLib easier in ZIP compressing/extracting.☆45Updated 6 months ago
- DuiLib with cef3 browser control☆37Updated 4 years ago
- Google PDFium without forms, js, ... just for readers☆21Updated 3 years ago
- 用duilib做的miniblink的浏览器☆100Updated 3 years ago
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆55Updated 9 years ago
- GNU character conversion library☆132Updated 4 months ago
- wxWidgets components to display PDF content with the PDFium library☆94Updated 4 years ago
- 网易云信跨平台C++开发框架☆58Updated last year
- http://www.pdflib.com/download/free-software/pdflib-lite-7/☆32Updated 9 years ago
- A C++11 header-only library for Microsoft Exchange Web Services☆71Updated 3 months ago
- a simple non-validating css1 and html parser written in C++ tuned for iOS and Mac☆91Updated 4 years ago