elifesciences / sciencebeam-parser
View external linksLinks

A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document.

☆296

Alternatives and similar repositories for sciencebeam-parser

Users that are interested in sciencebeam-parser are comparing it to the libraries listed below

Sorting:

elifesciences / sciencebeam-gym
View on GitHub
ScienceBeam Gym
☆25Mar 29, 2022Updated 3 years ago
CeON / CERMINE
View on GitHub
Content ExtRactor and MINEr
☆512Jun 30, 2022Updated 3 years ago
grobidOrg / grobid
View on GitHub
A machine learning software for extracting information from scholarly documents
☆4,630Feb 6, 2026Updated last week
ckorzen / icecite
View on GitHub
The repository of Icecite, a research paper management system.
☆15Mar 29, 2018Updated 7 years ago
PRImA-Research-Lab / prima-aletheia-web-emop
View on GitHub
Web-based page layout editor created for EMOP (Early Modern OCR Project).
☆11May 21, 2021Updated 4 years ago
OpenJournal / central
View on GitHub
Universalizing Open-Access Journals & Papers
☆19Mar 8, 2017Updated 8 years ago
allenai / science-parse
View on GitHub
Science Parse parses scientific papers (in PDF form) and returns them in structured form.
☆694May 26, 2024Updated last year
Authorea / texstyles
View on GitHub
Authorea's collection of LaTeX-based export styles for scholarly writing
☆20Sep 19, 2016Updated 9 years ago
essepuntato / opencitations
View on GitHub
OpenCitations provides in RDF accurate citation information harvested from the scholarly literature.
☆68Feb 19, 2018Updated 7 years ago
oaworks / plugin
View on GitHub
The One True Open Access Button - cross-compatible extension for research papers and data.
☆49Oct 8, 2024Updated last year
allenai / aristo-mini
View on GitHub
Aristo mini is a light-weight question answering system that can quickly evaluate Aristo science questions with an evaluation web server …
☆96Oct 31, 2018Updated 7 years ago
internetarchive / fatcat
View on GitHub
Perpetual Access To The Scholarly Record
☆120Jul 31, 2024Updated last year
elifesciences / lens
View on GitHub
A novel way of viewing eLife articles.
☆378Apr 21, 2022Updated 3 years ago
CrossRef / pdfextract
View on GitHub
MOVED TO https://gitlab.com/crossref/pdfextract
☆510Jul 26, 2017Updated 8 years ago
oeg-upm / morph-csv
View on GitHub
Enhancing virtual KG access over tabular data with RML and CSVW
☆12Jan 7, 2023Updated 3 years ago
allenai / brat
View on GitHub
brat rapid annotation tool (brat) - for all your textual annotation needs
☆10Feb 3, 2018Updated 8 years ago
breck7 / pau
View on GitHub
Medical records you can copy and paste
☆12Mar 3, 2023Updated 2 years ago
cytoscape / cytoscape.js-hierarchical
View on GitHub
A Cytoscape.js extension for the hierarchical clustering algorithm
☆10Jul 26, 2017Updated 8 years ago
brett-lempereur / theme-base16grayscale
View on GitHub
Light and dark variants for Visual Studio Code of the Base16 Grayscale theme by Chris Kempson
☆10May 11, 2017Updated 8 years ago
allenai / pdffigures2
View on GitHub
Given a scholarly PDF, extract figures, tables, captions, and section titles.
☆725Mar 10, 2024Updated last year
elifesciences / elife-article-xml
View on GitHub
Full XML of all eLife articles including each revision.
☆27Feb 6, 2026Updated last week
BMKEG / lapdftextProject
View on GitHub
High-level build project for all LAPDF-Text submodules
☆103Jul 2, 2015Updated 10 years ago
zorba-processor / zorba
View on GitHub
Zorba - the NoSQL processor
☆42Dec 13, 2023Updated 2 years ago
opencitations / croci
View on GitHub
Repository of the Crowdsourced Open Citations Index (CROCI)
☆10Mar 19, 2019Updated 6 years ago
AlgebraicJulia / CSetAutomorphisms.jl
View on GitHub
Automorphism groups for CSets - generalizing the nauty algorithm to a broad class of data structures
☆14Oct 30, 2023Updated 2 years ago
Vitaliy-1 / JATSParser
View on GitHub
JATSParser is aimed to be integrated with Open Journal Systems 3.0+ for transforming JATS XML to various formats
☆13Apr 15, 2024Updated last year
ourresearch / journalsdb
View on GitHub
Open database of scholarly journals
☆10Oct 26, 2022Updated 3 years ago
seuretm / ocrd_typegroups_classifier
View on GitHub
☆10Mar 16, 2023Updated 2 years ago
explosion / curated-tokenizers
View on GitHub
Lightweight piece tokenization library
☆12Apr 15, 2024Updated last year
HazyResearch / pdftotree
View on GitHub
A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.
☆461Aug 3, 2023Updated 2 years ago
jawline / PiFC
View on GitHub
Building a Raspberry Pi flight controller
☆12Nov 30, 2015Updated 10 years ago
PRImA-Research-Lab / prima-core-libs
View on GitHub
Core libraries by the PRImA Research Lab
☆16Jul 30, 2024Updated last year
vineetjohn / research-review-notes
View on GitHub
Research Paper Review Notes
☆13Oct 26, 2018Updated 7 years ago
m00nlight / minizinc-mode
View on GitHub
Emacs mode for editing MiniZinc model file
☆12Apr 26, 2023Updated 2 years ago
istex-archives / istex-browser-extension
View on GitHub
Bouton ISTEX : extension web capable d'insérer dynamiquement sur la page web consultée un lien vers le fulltext d'un document si ce dern…
☆11May 30, 2023Updated 2 years ago
trevorld / r-getopt
View on GitHub
R package providing basic command line optional argument parsing
☆12Oct 1, 2023Updated 2 years ago
Gorov / FCT_PhraseSim_TACL
View on GitHub
☆12Jul 24, 2017Updated 8 years ago
breck7 / sleepytimeconference
View on GitHub
The conference that comes together while you sleep.
☆17Feb 12, 2021Updated 5 years ago
xml-director / xmldirector.plonecore
View on GitHub
XML Director - XML Content Management
☆16Jan 11, 2024Updated 2 years ago

elifesciences / sciencebeam-parserView external linksLinks

Alternatives and similar repositories for sciencebeam-parser

elifesciences / sciencebeam-parser
View external linksLinks