lfoppiano / structure-visionLinks
Viewer for the structure extracted by Grobid on PDF documents
☆54Updated this week
Alternatives and similar repositories for structure-vision
Users that are interested in structure-vision are comparing it to the libraries listed below
Sorting:
- Streamlit PDF viewer☆176Updated this week
- multimodal document analysis☆166Updated last year
- Scientific Document Insight Q/A☆30Updated 2 weeks ago
- A spaCy wrapper for GliNER☆118Updated 7 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 7 months ago
- Logical structure analysis for visually structured documents☆91Updated 3 years ago
- A Python library to chunk/group your texts based on semantic similarity.☆96Updated last year
- Examples using the Deep Search functionalities☆85Updated 7 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- PDF parser powered by grobid☆28Updated last year
- 🖍️ Highlight text in documents☆109Updated 4 months ago
- Robust and fast topic models with sentence-transformers.☆80Updated this week
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 2 years ago
- A Python Search Engine for Humans 🥸☆232Updated last year
- GLiNER model in a FastAPI microservice.☆45Updated 9 months ago
- Python client for GROBID Web services☆358Updated last week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- ☆192Updated last week
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆191Updated last week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆70Updated 8 months ago
- Software that makes labeling PDFs easy.☆420Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆238Updated 3 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆368Updated last month
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆58Updated 2 years ago
- Zero and Few shot named entity & relationships recognition☆386Updated this week
- Python API for https://vespa.ai, the open big data serving engine☆141Updated this week
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆93Updated last year