deepdoctection / notebooksLinks
Repository for deepdoctection tutorial notebooks
☆45Updated 3 weeks ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understanding☆126Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆52Updated 9 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-par…☆60Updated last week
- ☆22Updated 3 months ago
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆42Updated 3 months ago
- Efficient few-shot learning with cross-encoders.☆54Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 11 months ago
- 🖍️ Highlight text in documents☆109Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆43Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆83Updated 6 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- Build document-native LLM applications☆53Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆49Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆67Updated 6 months ago
- Pandas-LLM☆46Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆105Updated 2 weeks ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆75Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- ☆27Updated last year
- GLiNER model in a FastAPI microservice.☆44Updated 7 months ago
- Lightweight Non-Parametric Embedding Fine-Tuning☆25Updated 9 months ago
- ☆22Updated last year
- Create a music review RAG application with Neo4j☆20Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆73Updated 8 months ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆115Updated last year
- Python API for https://vespa.ai, the open big data serving engine☆127Updated last week