allenai / pacer-docket-parser

☆9

Alternatives and similar repositories for pacer-docket-parser:

Users that are interested in pacer-docket-parser are comparing it to the libraries listed below

applicaai / CCpdf
Index of URLs to pdf files all over the internet and scripts
☆23Updated last year
DS3Lab / DocParser
☆79Updated 3 years ago
allenai / mmda
multimodal document analysis
☆164Updated 10 months ago
allenai / smashed
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…
☆33Updated 11 months ago
project-deepform / deepform
Experimental form data extraction for journalism
☆77Updated 4 years ago
applicaai / kleister-charity
☆39Updated 3 years ago
IBM / model-recycling
Ranking of fine-tuned HF models as base models.
☆35Updated last year
darrow-labs / LegalLens
☆8Updated 9 months ago
Layout-Parser / annotation-service
☆16Updated 3 years ago
robinvanschaik / interpret-flair
A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.
☆27Updated 3 years ago
applicaai / kleister-nda
☆57Updated 3 years ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
allenai / vila
Incorporating VIsual LAyout Structures for Scientific Text Classification
☆175Updated 2 years ago
jmzhao / pbos
☆19Updated 4 years ago
megagonlabs / ruler
Data Programming by Demonstration (DPBD) for Document Classification
☆35Updated 3 years ago
OpenBioLink / ITO
Intelligence Task Ontology (ITO)
☆73Updated 2 years ago
drgriffis / text-essence
Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus
☆14Updated last year
allenai / SPECTER2
☆87Updated 11 months ago
MaxHalford / orc
🧌 Parsing structured information from OCR outputs
☆19Updated last year
KGCP / MEL-TNNT
Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)
☆23Updated 2 years ago
practicalweaksupervisionbook / companion
☆24Updated 3 years ago
revuel / PatternOmatic
Finds linguistic patterns effortlessly
☆36Updated last year
allenai / scirepeval
SciRepEval benchmark training and evaluation scripts
☆74Updated 11 months ago
malteos / aspect-document-similarity
Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020
☆62Updated 11 months ago
Knowledgator / utca
Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…
☆30Updated 2 weeks ago
jarobyte91 / post_ocr_correction
Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"
☆36Updated last year
philschmid / optimum-static-quantization
☆28Updated last year
argilla-io / adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
☆58Updated last year
wjbmattingly / LeetTopic
☆54Updated last year
due-benchmark / baselines
The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."
☆36Updated 2 years ago