stanfordnlp/pdf-struct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stanfordnlp/pdf-struct)

stanfordnlp / pdf-struct

Logical structure analysis for visually structured documents

☆95

Alternatives and similar repositories for pdf-struct

Users that are interested in pdf-struct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChrizH / pdfstructure
View on GitHub
`pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.
☆106Apr 1, 2024Updated 2 years ago
IBM / retrieval-table-augmentation
View on GitHub
This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"
☆12Sep 17, 2025Updated 9 months ago
MBAigner / PDFSegmenter
View on GitHub
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…
☆23Sep 11, 2020Updated 5 years ago
KlausC / PkgVersion.jl
View on GitHub
Access `version`, `uuid`, etc. in `Project.toml`
☆13May 6, 2024Updated 2 years ago
ELS-RD / iaetdroit
View on GitHub
Annotation de la jurisprudence des CA Fr
☆12May 4, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Seben7 / text-match-cut
View on GitHub
Text Match Cut Video Generator Web App
☆37Feb 19, 2026Updated 4 months ago
IanButterworth / Darknet.jl
View on GitHub
Julia wrapper for AlexeyAB's fork of Darknet for YOLOV4/3/2 Object Detection
☆16Nov 24, 2025Updated 7 months ago
MGYamada / Handagote.jl
View on GitHub
Experimental forward-mode AD for tensor networks
☆14Dec 29, 2022Updated 3 years ago
JuliaCollections / LeftChildRightSiblingTrees.jl
View on GitHub
Memory-efficient representation of a tree with arbitrary number of children/node
☆16Jun 24, 2026Updated last week
ag-sc / lemon.dbpedia
View on GitHub
lemon lexicon for DBpedia
☆28Oct 13, 2015Updated 10 years ago
laborg / DocumenterEpub.jl
View on GitHub
EPUB Writer for Documenter.jl
☆20Jan 21, 2024Updated 2 years ago
kskyten / FromPython.jl
View on GitHub
☆23Sep 28, 2021Updated 4 years ago
thautwarm / DIO.jl
View on GitHub
Julia implementation for Python Restrain JIT
☆22Mar 3, 2021Updated 5 years ago
tanfiona / CauseEffectDetection
View on GitHub
Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".
☆13Feb 11, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
LeviBorodenko / img2rag
View on GitHub
Convert any image into a Region Adjacency Graph (RAG)
☆12Apr 27, 2020Updated 6 years ago
trusthlt / mining-legal-arguments
View on GitHub
Mining Legal Arguments in Court Decisions - Data and software
☆78May 15, 2023Updated 3 years ago
synalp / jtrans
View on GitHub
text-to-speech alignment java software
☆20Aug 25, 2019Updated 6 years ago
sahibpreetsingh12 / llm-learning
View on GitHub
☆15Jun 10, 2024Updated 2 years ago
JuliaPackaging / JLLPrefixes.jl
View on GitHub
Make yourself at home; JLLs are here to stay
☆23Feb 27, 2026Updated 4 months ago
krtab / ssccpp
View on GitHub
The Simple Switch Cases Configuration PreProcessor
☆12Mar 4, 2020Updated 6 years ago
TuanaCelik / unstructuredio-haystack
View on GitHub
💙 Unstructured Data Connectors for Haystack 2.0
☆18Sep 21, 2023Updated 2 years ago
jmccrae / wn-rdf
View on GitHub
WordNet RDF export
☆25Aug 4, 2017Updated 8 years ago
phamquiluan / table-transformer
View on GitHub
CVPR 2022: Table Structure Recognition
☆40Apr 19, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pygongnlp / CoSearchAgent
View on GitHub
[SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models
☆30Feb 15, 2024Updated 2 years ago
johannesloetzsch / nix-docker-cljc
View on GitHub
reproducible dev+test+production environments for java+javascript+clojure(script)
☆13Feb 2, 2021Updated 5 years ago
rickyang1114 / multimodal-deepresearcher
View on GitHub
[AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework
☆57Jun 8, 2026Updated 3 weeks ago
NilsBarlaug / lemon
View on GitHub
LEMON: Explainable Entity Matching
☆19Apr 6, 2022Updated 4 years ago
d3plus / d3plus-shape
View on GitHub
Fancy SVG shapes for visualizations
☆20Apr 23, 2024Updated 2 years ago
OCR-D / ocrd_anybaseocr
View on GitHub
DFKI Layout Detection for OCR-D
☆47May 1, 2025Updated last year
eliask / pdfssa4met
View on GitHub
PDF Structure and Syntactic Analysis for Metadata Extraction and Tagging - https://code.google.com/p/pdfssa4met/
☆19Mar 6, 2013Updated 13 years ago
deadbits / vector-embedding-api
View on GitHub
Flask API for generating text embeddings using OpenAI or sentence_transformers
☆14Sep 1, 2023Updated 2 years ago
GunnarFarneback / DynamicallyLoadedEmbedding.jl
View on GitHub
Embed Julia with dynamical loading of libjulia at runtime.
☆18Dec 5, 2025Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AtsushiSakai / SciPy.jl
View on GitHub
Julia interface for SciPy
☆28Dec 5, 2022Updated 3 years ago
MaxHalford / pointu
View on GitHub
Pointillisme tool based on Weighted Voronoi Stippling
☆37Mar 3, 2020Updated 6 years ago
re-Isearch / re-Isearch
View on GitHub
Open Source re-Isearch Project
☆19Jun 4, 2026Updated 3 weeks ago
vemonet / json-ld-editor-react
View on GitHub
🧙‍♂️📝 JSON-LD web editor, with autocomplete based on the loaded ontologies concepts and properties
☆15Apr 22, 2023Updated 3 years ago
sh0416 / oommix
View on GitHub
Official implementation for ACL2021 Oral Paper: "OoMMix: Out-of-manifold Regularization in Contextual Embedding Space for Text Classifica…
☆13May 24, 2021Updated 5 years ago
mmohsinkhan / cre
View on GitHub
Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..
☆11Dec 15, 2022Updated 3 years ago
KMCS-NII / PDFNLT-1.0
View on GitHub
Tools for Natural Language Text aware PDF structure analysis
☆15Mar 11, 2022Updated 4 years ago