kspeeckaert / pyPdfCompareLinks
Visual, page-by-page comparison of two PDF files
☆21Updated 11 years ago
Alternatives and similar repositories for pyPdfCompare
Users that are interested in pyPdfCompare are comparing it to the libraries listed below
Sorting:
- A natural language date parser. (Python version of chrono.js)☆25Updated 4 months ago
- Python interface to the Airtable's REST API☆274Updated 11 months ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- ☆19Updated 4 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- Reading legal authority for the last time☆40Updated 7 months ago
- AllThePatents tooling☆10Updated last year
- Next-generation Punkt sentence boundary detection with zero dependencies☆18Updated 2 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆155Updated 3 weeks ago
- API client for fetching and comparing passages from legislation☆14Updated 8 months ago
- Detect and visualize text reuse☆118Updated last year
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Updated 4 years ago
- This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)☆16Updated 2 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆77Updated 3 years ago
- Wikidata authority file mapping tool☆11Updated 7 years ago
- pythonic interface to the courtlistener api☆20Updated 6 years ago
- A Python canonicalizer to disambiguate and recognize known names from a poor quality data entry list.☆20Updated 9 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- an extensible tool to generate hyperlinks from legal citations☆36Updated last year
- Get list of common stop words in various languages in Python☆156Updated last year
- Graph extraction and NLP analysis for Baleen Corpora☆18Updated 9 years ago
- Named entity recognition for the legal domain☆42Updated 4 years ago
- remove signature blocks from emails☆86Updated 6 years ago
- A financial disclosure data extraction tool.☆18Updated 2 years ago
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆30Updated 3 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year