zanibbi / SymbolScraperLinks
Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.
☆19Updated 3 months ago
Alternatives and similar repositories for SymbolScraper
Users that are interested in SymbolScraper are comparing it to the libraries listed below
Sorting:
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 5 years ago
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖☆143Updated 3 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆136Updated last year
- Extracting scientific claims from biomedical abstracts (powered by AllenNLP)☆143Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Workshop Home Page for Benchmarking: Past, Present and Future☆35Updated 4 years ago
- ☆95Updated 3 years ago
- Direct Attentive Dependency Parser☆54Updated last year
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 5 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated 2 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆107Updated 3 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 3 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆121Updated 4 years ago
- Superfast CUDA implementation of Word2Vec and Latent Dirichlet Allocation (LDA)☆45Updated 4 years ago
- Implementation for EACL 2021 paper "Scientific Discourse Tagging for Evidence Extraction".☆20Updated 4 years ago
- Code and material for the AllenNLP Guide☆86Updated 2 years ago
- ☆82Updated 3 years ago
- Framework for information extraction from tables☆41Updated 6 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆119Updated 4 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆29Updated 4 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆38Updated 3 years ago
- Dataset accompanying the SPECTER model☆142Updated 2 years ago
- ☆40Updated 4 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- a large scientific paraphrase dataset for longer paraphrase generation☆39Updated 3 years ago
- ☆14Updated 3 years ago
- ☆58Updated 4 years ago
- A community-built high-quality repository of NLP corpora☆63Updated 3 years ago
- Train transformer-based models.☆28Updated this week