explosion / spacy-layoutLinks
π Process PDFs, Word documents and more with spaCy
β644Updated 3 months ago
Alternatives and similar repositories for spacy-layout
Users that are interested in spacy-layout are comparing it to the libraries listed below
Sorting:
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β218Updated 2 weeks ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)β426Updated 8 months ago
- SpanMarker for Named Entity Recognitionβ434Updated 5 months ago
- A spaCy wrapper for GliNERβ116Updated 4 months ago
- Fast Semantic Text Deduplication & Filteringβ738Updated 3 weeks ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.β328Updated 2 weeks ago
- π¦ Integrating LLMs into structured NLP pipelinesβ1,267Updated 5 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β797Updated 4 months ago
- Simple package to extract text with coordinates from programmatic PDFsβ133Updated this week
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024β2,105Updated this week
- Extract structured text from pdfs quicklyβ497Updated 2 weeks ago
- A python library to define and validate data types in Docling.β148Updated this week
- Running Docling as an API serviceβ479Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,284Updated 2 weeks ago
- 𦦠weasel: A small and easy workflow systemβ84Updated 11 months ago
- β51Updated 3 months ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-sβ¦β216Updated 5 months ago
- Late Interaction Models Training & Retrievalβ452Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,459Updated 3 weeks ago
- β125Updated this week
- Software that makes labeling PDFs easy.β415Updated last year
- Efficiently find the best-suited language model (LM) for your NLP taskβ124Updated 3 weeks ago
- RAG (Retrieval-Augmented Generation) Chatbot Examples Using PyMuPDFβ959Updated last week
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,021Updated last week
- Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and croβ¦β815Updated 6 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipyβ1,213Updated 3 weeks ago
- Generalist and Lightweight Model for Text Classificationβ134Updated last week
- π©π»βπ³ A collection of example notebooks using Haystackβ482Updated last week
- Fast, Accurate, Lightweight Python library to make State of the Art Embeddingβ2,163Updated this week
- A very simple news crawler with a funny nameβ389Updated last week