tokgolich / doctotext

Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP), OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE), ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text. Extracts metadata and annotations.
130Updated 7 years ago

Alternatives and similar repositories for doctotext:

Users that are interested in doctotext are comparing it to the libraries listed below