ad-freiburg / pdfact
A basic tool that extracts the structure from the PDF files of scientific articles.
☆74Updated 3 years ago
Alternatives and similar repositories for pdfact
Users that are interested in pdfact are comparing it to the libraries listed below
Sorting:
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- link raw affiliation to ROR ids☆30Updated last year
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Open Access PDF harvester☆40Updated last year
- Scripts used to make and evaluate OpenAlex's concept tagging model☆48Updated last year
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆54Updated 7 months ago
- GROBID extension for identifying and normalizing physical quantities.☆81Updated 7 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆137Updated 7 months ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- A Named-Entity Recogniser based on Grobid.☆52Updated 7 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- The Semantic Scholar Search Reranker☆109Updated 4 years ago
- Collection of Datasets for Legal Text Processing☆101Updated last year
- multimodal document analysis☆164Updated 11 months ago
- PDF to XML ALTO file converter☆237Updated this week
- LegalCrawler: A tool for automated scraping of English legal corpora☆55Updated 2 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆24Updated 7 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- ☆34Updated last year
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated last year
- ☆89Updated 11 months ago
- ☆32Updated 2 years ago
- A machine learning tool for fishing entities☆264Updated last month
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆73Updated 8 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆17Updated last year
- NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …☆36Updated 2 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 3 years ago
- Find legal citations in any block of text☆150Updated this week
- Keeping It Simple is Hard☆10Updated last year