zanibbi / SymbolScraperLinks
Apache PDFBox extension for precisely extracting character/symbol locations and identities from born-digital PDF files.
β19Updated 3 weeks ago
Alternatives and similar repositories for SymbolScraper
Users that are interested in SymbolScraper are comparing it to the libraries listed below
Sorting:
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" π€β142Updated 3 years ago
- β94Updated 3 years ago
- Workshop Home Page for Benchmarking: Past, Present and Futureβ35Updated 4 years ago
- LongSumm - Scientific Document Summarization Taskβ74Updated 3 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122β136Updated last year
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchβ76Updated 4 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OFβ¦β29Updated 3 years ago
- Hyperparameter Search for AllenNLPβ139Updated 7 months ago
- SciWING is a modern toolkit for scientific document processing from WING-NUSβ63Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ π€Transformersβ49Updated 2 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)β121Updated 4 years ago
- Dataset accompanying the SPECTER modelβ139Updated 2 years ago
- β17Updated 2 years ago
- QED: A Framework and Dataset for Explanations in Question Answeringβ117Updated 4 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classificationβ180Updated 2 years ago
- β58Updated 4 years ago
- Code and material for the AllenNLP Guideβ86Updated 2 years ago
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLPβ107Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/β88Updated 4 months ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correctionβ120Updated 4 years ago
- Question-answers, collected from Googleβ128Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β147Updated 4 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)β71Updated 2 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorchβ118Updated 4 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"β201Updated 2 years ago
- Direct Attentive Dependency Parserβ54Updated last year
- Code for pre-training CharacterBERT models (as well as BERT models).β34Updated 4 years ago
- β56Updated 3 years ago
- β75Updated 4 years ago
- β14Updated 3 years ago