butlerlabs / docaiLinks
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Updated 2 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- ☆11Updated 2 years ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 11 months ago
- Repository for deepdoctection tutorial notebooks☆45Updated 2 weeks ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- ☆13Updated last year
- Universal text classifier for generative models☆24Updated 11 months ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆17Updated last week
- GLiNER model in a FastAPI microservice.☆44Updated 6 months ago
- ☆15Updated 4 years ago
- This project provides a pipeline for deploying and performing inference with the YOLOv8 object detection model using the Triton Inference…☆15Updated 2 months ago
- a streaming markdown component for streamlit with LaTeX, Mermaid, Table, code support. A drop-in replacement for st.markdown.☆20Updated 4 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- CTE: Contextualized Table Extraction Dataset☆17Updated 2 years ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Updated 3 years ago
- Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings f…☆13Updated 10 months ago
- Automated PDF and text processing with Spacy and NLTK; information extraction from text based on grammatical structure; deployed on extra…☆16Updated 3 years ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated last year
- ☆47Updated 9 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- CRUD Word documents with Python☆11Updated 7 months ago
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆24Updated last year
- An agent to generate stunning images :)☆21Updated last month
- ☆41Updated 6 months ago
- ☆14Updated last year