deepdoctection / notebooksLinks
Repository for deepdoctection tutorial notebooks
β46Updated last month
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understandingβ128Updated last year
- ποΈ Highlight text in documentsβ109Updated 3 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β52Updated 9 months ago
- β22Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning modβ¦β20Updated 2 years ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β173Updated 10 months ago
- A Python library to chunk/group your texts based on semantic similarity.β97Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Pinecone text client libraryβ65Updated 4 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ76Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ45Updated last year
- π Unstructured Data Connectors for Haystack 2.0β17Updated last year
- Efficient few-shot learning with cross-encoders.β56Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 10 months ago
- GLiNER model in a FastAPI microservice.β45Updated 7 months ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ115Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β83Updated 7 months ago
- β189Updated last month
- Generalist and Lightweight Model for Text Classificationβ148Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 9 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β113Updated 2 weeks ago
- Fully working applications that demonstrate how to use Haystack to implement various use casesβ124Updated 3 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β68Updated 7 months ago
- β101Updated last year
- A spaCy wrapper for GliNERβ118Updated 6 months ago
- β122Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-parβ¦β60Updated 3 weeks ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated last year
- Create a music review RAG application with Neo4jβ21Updated last year