lfoppiano / structure-visionLinks
Viewer for the structure extracted by Grobid on PDF documents
☆55Updated 2 weeks ago
Alternatives and similar repositories for structure-vision
Users that are interested in structure-vision are comparing it to the libraries listed below
Sorting:
- Streamlit PDF viewer☆188Updated 2 weeks ago
- Logical structure analysis for visually structured documents☆93Updated 3 years ago
- multimodal document analysis☆166Updated last week
- Scientific Document Insight Q/A☆31Updated 2 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆99Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- A Streamlit app for showing a TimelineJS about the history of Natural Language Processing☆29Updated 2 years ago
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆58Updated 2 years ago
- ☆199Updated 2 weeks ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆98Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆81Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 3 years ago
- Python API for https://vespa.ai, the open big data serving engine☆147Updated last week
- A Python Search Engine for Humans 🥸☆238Updated last year
- GLiNER model in a FastAPI microservice.☆45Updated 11 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- ☆23Updated 7 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆219Updated 9 months ago
- Repository for deepdoctection tutorial notebooks☆46Updated 5 months ago
- A spaCy wrapper for GliNER☆124Updated 9 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆107Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆193Updated last week
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆72Updated 10 months ago
- Benchmarking PDF libraries☆315Updated 4 months ago
- A Streamlit component for annotating text by text selecting.☆42Updated last year
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 3 years ago
- The Semantic Scholar Search Reranker☆107Updated 5 years ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆113Updated last year