butlerlabs / docaiLinks
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Updated 2 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Repository for deepdoctection tutorial notebooks☆45Updated last week
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Updated 4 years ago
- ☆11Updated last year
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆36Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- Scripts for reading, extracting, and organizing data from either HTML or PDF documents and prepare them to be converted into embeddings f…☆13Updated 10 months ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- CRUD Word documents with Python☆11Updated 7 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆23Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆17Updated last month
- This project provides a pipeline for deploying and performing inference with the YOLOv8 object detection model using the Triton Inference…☆15Updated 2 months ago
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- a streaming markdown component for streamlit with LaTeX, Mermaid, Table, code support. A drop-in replacement for st.markdown.☆20Updated 4 months ago
- ☆15Updated 4 years ago
- ☆11Updated last month
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated 10 months ago
- Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy☆17Updated 2 years ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 2 years ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- Pandas-LLM☆46Updated last year
- Explore new advancements like ChatGPT’s function calling capability, and build a conversational agent using a new syntax called LangChain…☆15Updated last year
- Online Inference API for NLP Transformer models - summarization, text classification, sentiment analysis and more☆44Updated last year
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆23Updated last year