deepdoctection / notebooks
Repository for deepdoctection tutorial notebooks
☆44Updated 5 months ago
Alternatives and similar repositories for notebooks:
Users that are interested in notebooks are comparing it to the libraries listed below
- ☆22Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆125Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆49Updated 7 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆44Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 8 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆30Updated 3 weeks ago
- ☆45Updated 7 months ago
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- ☆14Updated last year
- Generalist and Lightweight Model for Text Classification☆124Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 8 months ago
- Lightweight Non-Parametric Embedding Fine-Tuning☆25Updated 7 months ago
- A tutorial on DSPy and whether automated prompt engineering lives up to the hype☆22Updated last year
- Universal text classifier for generative models☆24Updated 9 months ago
- Contains Google Colab or Jupyter notebooks, as well as other associated files for my Medium blogposts.☆35Updated 11 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- Pandas-LLM☆44Updated last year
- 🖍️ Highlight text in documents☆107Updated 2 weeks ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆30Updated 8 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆71Updated 9 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆24Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year