zanibbi / SymbolScraperLinks
Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.
☆19Updated 3 years ago
Alternatives and similar repositories for SymbolScraper
Users that are interested in SymbolScraper are comparing it to the libraries listed below
Sorting:
- ☆93Updated 3 years ago
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖☆142Updated 3 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 3 years ago
- Direct Attentive Dependency Parser☆54Updated last year
- Converter from UD-trees to BART representation☆36Updated last year
- Extracting scientific claims from biomedical abstracts (powered by AllenNLP)☆144Updated 4 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆29Updated 3 years ago
- Implementation for EACL 2021 paper "Scientific Discourse Tagging for Evidence Extraction".☆20Updated 3 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆135Updated last year
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆108Updated 3 years ago
- Workshop Home Page for Benchmarking: Past, Present and Future☆35Updated 3 years ago
- ☆17Updated 2 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆118Updated 4 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆58Updated 2 weeks ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated last year
- Dataset accompanying the SPECTER model☆138Updated 2 years ago
- An open information extraction system that provides compact extractions☆93Updated 3 years ago
- NaturalProofs: Mathematical Theorem Proving in Natural Language (NeurIPS 2021 Datasets & Benchmarks)☆133Updated 2 years ago
- ☆40Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- Statistics on multilingual datasets☆17Updated 3 years ago