metebalci / pdftitle
a utility to extract the title from a PDF file
☆139Updated this week
Alternatives and similar repositories for pdftitle:
Users that are interested in pdftitle are comparing it to the libraries listed below
- A turnkey command for converting a LaTeX source to ar5iv-style HTML☆59Updated 11 months ago
- PDF to XML ALTO file converter☆224Updated this week
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆132Updated 5 months ago
- A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.☆114Updated 3 months ago
- CLI for document conversion for scientific documents, powered by Mathpix OCR☆101Updated last year
- Logical structure analysis for visually structured documents☆86Updated 2 years ago
- Science-parse version 2☆235Updated 5 years ago
- Extract bibliographic references from (High-Energy Physics) articles.☆133Updated last month
- Python client for GROBID Web services☆308Updated 3 weeks ago
- Neuralized version of the Reference String Parser component of the ParsCit package.☆80Updated 2 years ago
- Simple, faithful BibTeX parser and algorithms for Python 3☆117Updated 11 months ago
- A tidy and complete archive of metadata for papers on arxiv.org, 1993-2019☆28Updated 5 years ago
- The Semantic Scholar Search Reranker☆104Updated 4 years ago
- Command line tool to extract figures, tables, and captions from scholarly documents in PDF form.☆130Updated 6 years ago
- multimodal document analysis☆162Updated 8 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- A general purpose processing framework for corpora of scientific documents☆58Updated 9 months ago
- Simple LaTeX parser providing latex-to-unicode and unicode-to-latex conversion☆336Updated 3 weeks ago
- A machine learning tool for fishing entities☆258Updated this week
- A Python tool kit for interacting with the locally hosted Zotero database.☆31Updated 3 years ago
- Improved version of Detex - tool for extracting plain text from TeX and LaTeX sources☆244Updated 3 months ago
- Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins …☆57Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆59Updated 9 months ago
- A python library/command-line tool to quickly and automatically generate BibTeX data starting from the pdf file of a scientific publicat…☆70Updated 6 months ago
- A python script for checking BibLatex .bib files for common referencing mistakes!☆176Updated last year
- AnyStyle Command Line Interface☆58Updated 2 years ago
- Cleans up your LaTeX files.☆151Updated last year
- arXiv plain text extraction☆41Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- JSON representation of the Zotero data model☆52Updated 2 weeks ago