janosh / pdf-compressorLinks
CLI + Python API for batch compressing PDFs
☆32Updated last week
Alternatives and similar repositories for pdf-compressor
Users that are interested in pdf-compressor are comparing it to the libraries listed below
Sorting:
- Converts a single/double-column PDF formatted paper into a html page, which has the original view & the paragraph view extracted from the…☆26Updated 3 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆66Updated last year
- Common Crawl Index Server☆70Updated 7 months ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated 2 years ago
- Conversion tools from various formats to StarDict.☆33Updated this week
- Record github trending everyday.☆57Updated last year
- A markdown-supported command-line interface tool that connects to ChatGPT using OpenAI's API key.☆48Updated 2 years ago
- Extract structured data from PDF invoices☆14Updated 4 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆132Updated 2 years ago
- Yet another tool to search through your (exported) ChatGPT conversations☆12Updated last year
- ☆43Updated 9 months ago
- Type discovery for Python☆24Updated 9 years ago
- Using Selenium and DeepL to translate powerpoint files with python-pptx☆47Updated 4 years ago
- Python Script to Convert Subtitle formats from .srt to .ass☆34Updated 3 years ago
- Python command line application to convert Markdown to PDF.☆54Updated last year
- Libreoffice extension to convert image to editable document☆17Updated 8 years ago
- Easy to use and open-source unknown stealer☆22Updated 2 years ago
- Converts csv data to markdown tables☆72Updated last year
- Convert curl commands to Python code in your browser☆11Updated 8 years ago
- Fetch novels from internet☆13Updated 4 years ago
- A Javascript library to convert number and monetary amount to written text in multiple languages. Also helpful for writing cheques (check…☆10Updated 5 months ago
- Experiments with Langchain using different approaches on Google colab☆24Updated last year
- List of tools for dealing with the wonderful PDF format.☆51Updated 5 years ago
- Web Extension - markdown sticky notes☆30Updated this week
- Linux utilities to make life easier and more convenient.☆17Updated 3 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- Convert PDF files into markdown files☆292Updated 5 years ago
- A desktop web application for learning Mandarin Chinese and its character stroke order.☆36Updated 2 years ago
- xlsxgrep is a CLI tool to search text in XLSX, XLS, XLSM, CSV, TSV and ODS files. It works similarly to Unix/GNU Linux grep.☆45Updated 6 months ago
- Advanced explorer of github.com☆93Updated last month