deepdoctection / notebooksLinks
Repository for deepdoctection tutorial notebooks
β48Updated 6 months ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understandingβ131Updated last year
- ποΈ Highlight text in documentsβ110Updated 8 months ago
- A Python library to chunk/group your texts based on semantic similarity.β101Updated last year
- β22Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β64Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β68Updated last month
- Universal text classifier for generative modelsβ25Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- β125Updated 9 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β48Updated last year
- π Unstructured Data Connectors for Haystack 2.0β17Updated 2 years ago
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-parβ¦β65Updated 3 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β85Updated 11 months ago
- Explore the use of DSPy for extracting features from PDFs πβ49Updated last year
- β201Updated 2 weeks ago
- Mistral + Haystack: build RAG pipelines that rock π€β106Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β180Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ112Updated 2 years ago
- GLiNER model in a FastAPI microservice.β47Updated last year
- Create a music review RAG application with Neo4jβ22Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ82Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated 2 years ago
- Python API for https://vespa.ai, the open big data serving engineβ151Updated this week
- Build document-native LLM applicationsβ55Updated last year
- A microframework for creating simple AI agents.β94Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β119Updated last week
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning modβ¦β20Updated 3 years ago
- Efficient few-shot learning with cross-encoders.β60Updated last year
- β74Updated last year
- β93Updated 2 years ago