johnlinp / pdf-to-markdown
Convert PDF files into markdown files
☆287Updated 4 years ago
Alternatives and similar repositories for pdf-to-markdown:
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
- Personal clone of Poppler, official repository is here: https://gitlab.freedesktop.org/poppler/poppler☆130Updated 6 years ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆50Updated 6 years ago
- A python module for writing pandoc filters, with a collection of examples☆531Updated 7 months ago
- This tool help you translate xmind file to markdown syntax. In future, it may support markdown syntax to xmind.☆70Updated last year
- Markdown rendering + Latex extras (equations, tables, ...), with conversion features, for the scientific community☆564Updated this week
- Pandoc filters that use Panflute☆59Updated 4 years ago
- Add Math to your Markdown with a KaTeX plugin for Markdown-it☆264Updated 9 months ago
- This is a Chrome Extension used to copy the element in current page as markdown format.☆54Updated 8 years ago
- Templates for pandoc, tagged to release☆481Updated last week
- Extract tables from scanned image PDFs using Optical Character Recognition.☆271Updated 4 years ago
- ☆181Updated 6 years ago
- Visualize markdown files as mindmaps in Atom editor☆117Updated 5 years ago
- Tool to convert a PDF file (myfile.pdf) to a fixed layout ePub file (myfile.epub). The layout is perfectly retained and all the fonts are…☆214Updated 3 years ago
- PDFbeads updated to run with Ruby 2.0☆23Updated 11 years ago
- pdf2SVG for windows (using poppler and cairo)☆185Updated 6 years ago
- Clarity theme for Gitbook designed for The Zen Approach☆74Updated 10 years ago
- Preprocessor for Markdown files to generate a table of contents and other documentation needs☆311Updated 3 years ago
- Python script to do PDF OCR conversion using Tesseract☆373Updated last year
- Work in progress for "Mastering Zotero"☆102Updated 6 years ago
- Default theme for GitBook☆194Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆271Updated 6 months ago
- 🗒️ A style sheet for Markdown☆260Updated 3 years ago
- python based software to unpack kindlegen generated ebooks☆61Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆382Updated 6 months ago
- Stylesheets for Markdown to HTML conversion☆98Updated 5 years ago
- Linguistic Annotation and Visualization Tool for PDF Documents☆200Updated 5 years ago
- Some templates for Pandoc.☆720Updated last year
- PDF to XML ALTO file converter☆224Updated this week
- Demos, examples and utilities using PyMuPDF☆626Updated 7 months ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆145Updated last year