kspeeckaert / pyPdfCompareLinks
Visual, page-by-page comparison of two PDF files
☆21Updated 11 years ago
Alternatives and similar repositories for pyPdfCompare
Users that are interested in pyPdfCompare are comparing it to the libraries listed below
Sorting:
- API client for fetching and comparing passages from legislation☆11Updated 4 months ago
- Plugin for LLM adding support for Google's PaLM 2 model☆14Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- UNSUPPORTED Pasteboard - Python interface for NSPasteboard (macOS clipboard)☆35Updated 9 months ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- Apple event bridge for Python 3 (minimally maintained)☆61Updated 7 months ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- micro-library to produce a couple of basic, attractive, printable plots with matplotlib☆11Updated 7 years ago
- Split a JSON file with hierarchical data to multiple CSV files☆28Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- run applescript☆70Updated 4 years ago
- Hybrid architecture media server, media service and Streamlit client app using FastAPI and Python☆12Updated 2 years ago
- A conda-smithy repository for spacy.☆14Updated 2 weeks ago
- LLM plugin for embeddings using sentence-transformers☆65Updated last month
- Google App Scripts that sends a number of emails from the specific number and that tracks the open status of each email☆17Updated 5 months ago
- ☆18Updated 3 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 7 years ago
- 📖👓🏷Tag your getpocket.com articles automatically using natural language processing☆45Updated 5 years ago
- ☆30Updated 2 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆38Updated last year
- spaCy entry points for Curated Transformers☆31Updated last week
- Partial result caching for pandas in Python.☆19Updated 6 years ago
- Convert a Claude.ai export to SQLite☆49Updated 7 months ago
- Streamlit component for Jina neural search☆41Updated 3 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- A bibliographic reference correction service☆23Updated 2 years ago
- Search sites for RSS, Atom, and JSON feeds.☆18Updated 2 years ago
- A library for extracting tables from PDF files☆89Updated 4 years ago
- Add website scraping abilities to Datasette☆62Updated 2 years ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆30Updated 3 years ago