zanibbi / SymbolScraper
Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.
☆19Updated 3 years ago
Alternatives and similar repositories for SymbolScraper:
Users that are interested in SymbolScraper are comparing it to the libraries listed below
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆28Updated 3 years ago
- ☆91Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- Workshop Home Page for Benchmarking: Past, Present and Future☆35Updated 3 years ago
- Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"☆53Updated 2 years ago
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆107Updated 2 years ago
- Hyperparameter Search for AllenNLP☆137Updated last week
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 3 years ago
- Train transformer-based models.☆28Updated 3 weeks ago
- A reference implementation of algorithms for distributions over spanning trees.☆21Updated 5 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 7 months ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 3 months ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 10 months ago
- Tools to bulk download arxiv data☆121Updated 6 years ago
- ☆14Updated 2 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆38Updated 2 years ago
- Framework for information extraction from tables☆41Updated 5 years ago
- Data and code for SemEval 2019, Task 10: Math Question Answering☆47Updated 6 years ago
- Given a pair of sentences (premise, hypothesis), the decomposed graph entailment model (DGEM) predicts whether the premise can be used to…☆52Updated 4 years ago
- ☆28Updated 2 months ago
- ☆38Updated 3 years ago
- ☆57Updated 3 years ago
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…☆21Updated 5 years ago