h2oai / doctrLinks
docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
☆11Updated 3 months ago
Alternatives and similar repositories for doctr
Users that are interested in doctr are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆79Updated 3 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated 2 years ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆137Updated 2 years ago
- Open source no-code system for text annotation and building of text classifiers☆271Updated 8 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 3 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆81Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last month
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆80Updated this week
- ☆249Updated 3 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Large Language Model (LLM) Inference API and Chatbot☆128Updated last year
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 3 years ago
- Experiment and integrate with different OCR frameworks seamlessly☆102Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Label data using HuggingFace's transformers and automatically get a prediction service☆193Updated 2 years ago
- multimodal document analysis☆166Updated 2 months ago
- Custom recipe and utilities for document processing☆200Updated 3 years ago
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆35Updated 3 years ago
- Table detection with Florence.☆15Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- ☆13Updated 2 years ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Streamlit Named Entity Recognition (NER) annotation custom component☆39Updated 3 years ago
- Repository for deepdoctection tutorial notebooks☆50Updated last month
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆51Updated 3 years ago
- ☆392Updated 2 years ago