tokgolich / doctotextLinks
Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
☆131Updated 7 years ago
Alternatives and similar repositories for doctotext
Users that are interested in doctotext are comparing it to the libraries listed below
Sorting:
- wv is a library which allows access to Microsoft Word files. It can load and parse Word 2000, 97, 95 and 6 file formats. (These are the f…☆17Updated 7 years ago
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆88Updated last week
- Some much needed maintenance of http://silvercoders.com/en/products/doctotext/☆27Updated 2 years ago
- 文件解析 doctotext 源码 4.0-20140202 版本☆14Updated 8 years ago
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆112Updated 3 years ago
- The first open-source C++ development library for OFD.☆64Updated last year
- A wrap of Zlib, making the use of ZLib easier in ZIP compressing/extracting.☆45Updated 10 months ago
- Win32 IPC library with async api☆52Updated 9 years ago
- 用duilib做的miniblink的浏览器☆103Updated 3 years ago
- c++ library wrapper of 7zip☆134Updated 2 years ago
- ☆55Updated 4 years ago
- window下虚拟网卡TUN/TAP的帮助函数,可实现虚拟网卡的创建、设置IP、增加路由、读取和写入数据等操作☆14Updated 4 years ago
- 额外维护的DuiLib库☆46Updated 10 years ago
- Cross platform C/C++ library with C#, Java, Python, Progress 4GL wrappers and command line tools for generating Microsoft Word .DOCX (Ope…☆169Updated 8 years ago
- A light-weight C++ HTML processing library based on pugixml☆20Updated 13 years ago
- extract text from MS-WORD's .doc binary format file☆34Updated 2 years ago
- C/Python library to extract text from MS doc files☆11Updated 2 years ago
- mimetic is a free, MIT licensed, Email library (MIME) written in C++ designed to be easy to use and integrate but yet fast and efficient.☆43Updated last year
- PDFium library without V8 JavaScript engine - compiles under Linux, Mac and Windows☆62Updated 9 years ago
- Modern C++20 library for creating Microsoft Word Document (.docx file).☆138Updated 3 weeks ago
- ☆72Updated 5 years ago
- A simple and easy-to-use cross-platform C++ wrapper for the SQLite API. Fork of the CppSQLite project, originally by Rob Groves, currentl…☆151Updated 3 weeks ago
- PDFium Reader☆68Updated last year
- C++/C library to construct Excel .xls files in code. Official git repo.☆46Updated 4 years ago
- ☆37Updated 5 years ago
- 一个很简陋的ofdEditor☆39Updated 5 years ago
- GNU character conversion library☆139Updated 7 months ago
- pdf2word: sdk by BCL☆9Updated 8 years ago
- cximage 7.0.1 mirror☆82Updated 13 years ago
- daemon service on windows☆31Updated 9 years ago