deepdoctection / notebooksLinks
Repository for deepdoctection tutorial notebooks
β48Updated 2 weeks ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- DocLLM: A layout-aware generative language model for multimodal document understandingβ133Updated 2 years ago
- ποΈ Highlight text in documentsβ111Updated 8 months ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning modβ¦β20Updated 3 years ago
- Explore the use of DSPy for extracting features from PDFs πβ49Updated last year
- β22Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K β¦β86Updated last year
- A Python library to chunk/group your texts based on semantic similarity.β102Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β64Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Python API for https://vespa.ai, the open big data serving engineβ154Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated last month
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β183Updated last year
- Efficient few-shot learning with cross-encoders.β61Updated last year
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β120Updated 3 weeks ago
- Generalist and Lightweight Model for Text Classificationβ166Updated last month
- β104Updated last year
- GLiNER model in a FastAPI microservice.β47Updated last year
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-parβ¦β66Updated 2 weeks ago
- π Datasets and models for instruction-tuningβ238Updated 2 years ago
- Universal text classifier for generative modelsβ24Updated last year
- Synthetic Text Dataset Generation for LLM projectsβ55Updated last month
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ12Updated last year
- β75Updated last year
- β200Updated this week
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progrβ¦β34Updated 4 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β45Updated last year
- β125Updated 10 months ago
- Create a music review RAG application with Neo4jβ22Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extractionβ83Updated last year