lfoppiano / structure-visionLinks
Viewer for the structure extracted by Grobid on PDF documents
☆57Updated 2 months ago
Alternatives and similar repositories for structure-vision
Users that are interested in structure-vision are comparing it to the libraries listed below
Sorting:
- Scientific Document Insight Q/A☆33Updated 4 months ago
- Streamlit PDF viewer☆195Updated this week
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆220Updated last year
- Logical structure analysis for visually structured documents☆93Updated 3 years ago
- GLiNER model in a FastAPI microservice.☆47Updated last year
- Examples using the Deep Search functionalities☆83Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 4 years ago
- A spaCy wrapper for GliNER☆129Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 10 months ago
- Annotation meets Large Language Models (ChatGPT, GPT-3 and alike).☆58Updated 2 years ago
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆99Updated 2 years ago
- Repository for deepdoctection tutorial notebooks☆50Updated 3 weeks ago
- A Python library to chunk/group your texts based on semantic similarity.☆103Updated last year
- ☆201Updated this week
- multimodal document analysis☆166Updated 2 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆58Updated last year
- Efficient few-shot learning with cross-encoders.☆61Updated last year
- 🖍️ Highlight text in documents☆111Updated 9 months ago
- Robust and fast topic models with sentence-transformers.☆89Updated last week
- Python API for https://vespa.ai, the open big data serving engine☆158Updated this week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆201Updated this week
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- A Streamlit app for showing a TimelineJS about the history of Natural Language Processing☆29Updated 2 years ago
- Docling core data types and transformations☆223Updated this week
- Simple package to extract text with coordinates from programmatic PDFs☆236Updated this week
- ☆55Updated 2 years ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆137Updated 2 years ago
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆60Updated 11 months ago
- Generalist and Lightweight Model for Text Classification☆169Updated this week