kspeeckaert / pyPdfCompareLinks
Visual, page-by-page comparison of two PDF files
☆21Updated 11 years ago
Alternatives and similar repositories for pyPdfCompare
Users that are interested in pyPdfCompare are comparing it to the libraries listed below
Sorting:
- A natural language date parser. (Python version of chrono.js)☆25Updated 8 months ago
- Python port for IWNLP.Lemmatizer☆18Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- pythonic interface to the courtlistener api☆20Updated 7 years ago
- Kelvin Legal Data OS - Public Examples☆19Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆27Updated 2 months ago
- ☆20Updated 4 years ago
- API client for fetching and comparing passages from legislation☆14Updated last year
- Reading legal authority for the last time☆42Updated 11 months ago
- A database of court reporters, tests and other experiments☆122Updated 2 weeks ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 7 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)☆17Updated 3 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆87Updated last year
- Named entity recognition for the legal domain☆43Updated 4 years ago
- A library for extracting tables from PDF files☆92Updated 5 years ago
- A maximum-strength name parser for record linkage.☆39Updated 5 months ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆59Updated 7 years ago
- Use ML-Annotate to label data for machine learning purposes☆110Updated 5 years ago
- Deployment package for LexPredict ContraxSuite☆19Updated 6 years ago
- PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz☆39Updated last year
- A visualisation tool for Spacy using Hierplane.☆65Updated 3 years ago
- Deutsch Language Tool Kit☆12Updated 10 years ago