tokgolich / doctotextLinks
Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
☆132Updated 8 years ago
Alternatives and similar repositories for doctotext
Users that are interested in doctotext are comparing it to the libraries listed below
Sorting:
- DocWire SDK: Award-winning modern data processing in C++20. SourceForge Community Choice & Microsoft support. AI-driven processing. Suppo…☆94Updated 3 weeks ago
- wv is a library which allows access to Microsoft Word files. It can load and parse Word 2000, 97, 95 and 6 file formats. (These are the f…☆17Updated 7 years ago
- 文件解析 doctotext 源码 4.0-20140202 版本☆14Updated 8 years ago
- c++ library wrapper of 7zip☆137Updated 2 years ago
- C++/C library to construct Excel .xls files in code. Official git repo.☆47Updated 5 years ago
- libchardet - Mozilla's Universal Charset Detector C/C++ API☆112Updated 4 years ago
- The first open-source C++ development library for OFD.☆68Updated 2 years ago
- Some much needed maintenance of http://silvercoders.com/en/products/doctotext/☆28Updated 3 years ago
- mimetic is a free, MIT licensed, Email library (MIME) written in C++ designed to be easy to use and integrate but yet fast and efficient.☆45Updated last year
- A light-weight C++ HTML processing library based on pugixml☆21Updated 13 years ago
- 网易云信跨平台C++开发框架☆59Updated 2 years ago
- A wrap of Zlib, making the use of ZLib easier in ZIP compressing/extracting.☆45Updated last year
- GNU character conversion library☆148Updated last year
- Cross platform C/C++ library with C#, Java, Python, Progress 4GL wrappers and command line tools for generating Microsoft Word .DOCX (Ope…☆170Updated 8 years ago
- daemon service on windows☆31Updated 10 years ago
- A simple and easy-to-use cross-platform C++ wrapper for the SQLite API. Fork of the CppSQLite project, originally by Rob Groves, currentl…☆158Updated 5 months ago
- C/Python library to extract text from MS doc files☆11Updated 3 years ago
- ☆72Updated 6 years ago
- a simple non-validating css1 and html parser written in C++ tuned for iOS and Mac☆94Updated 5 years ago
- 用duilib做的miniblink的浏览器☆104Updated 3 years ago
- This is not the poppler repository. Please see https://poppler.freedesktop.org/☆52Updated 15 years ago
- 由官方的duilib( https://github.com/duilib/duilib ) 复制过来的,仅为了个人学习使用。可以使用vs2008打开,新增一些学习测试用的demo,代码也会同步官方的后续修改。Q群:594312689。欢迎交流学习。☆125Updated 2 months ago
- cximage 7.0.1 mirror☆81Updated 13 years ago
- DuiLib项目个人维护版本,回归std::string,使用C++20,支持控件绑定☆101Updated last year
- A cross platform Http client with libcurl☆53Updated 7 years ago
- 借用了网络上SevenZip的封装,简化了7z压缩解压代码,只支持了几种常见的格式cab,zip,7z. 生成dll400k+.☆22Updated 8 years ago
- vlc-2.2.6 compile with VS2015☆35Updated 8 years ago
- from chromium☆130Updated last year
- A WTL GUI designer☆58Updated 6 years ago
- window下虚拟网卡TUN/TAP的帮助函数,可实现虚拟网卡的创建、设置IP、增加路由、读取和写入数据等操作☆14Updated 4 years ago