butlerlabs / docaiLinks
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Updated 2 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- ☆22Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- Repository for deepdoctection tutorial notebooks☆45Updated 2 weeks ago
- ☆15Updated 4 years ago
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆48Updated last week
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Updated 4 years ago
- CRUD Word documents with Python☆11Updated 7 months ago
- 📃 A contracts clause summarization system using LLM and vector database☆18Updated 4 months ago
- Automated PDF and text processing with Spacy and NLTK; information extraction from text based on grammatical structure; deployed on extra…☆16Updated 3 years ago
- ☆11Updated last year
- This project provides a pipeline for deploying and performing inference with the YOLOv8 object detection model using the Triton Inference…☆15Updated 2 months ago
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- Web application that allows you to interact with biomedical knowledge graphs and query biomedical questions.☆32Updated last year
- ☆22Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- ☆13Updated 2 months ago
- Pandas-LLM☆46Updated last year
- ☆13Updated 3 years ago
- ☆11Updated 2 years ago
- Open-source, knowledge-grounded conversational assistant☆13Updated this week
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆24Updated last year
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- AI_Powered_Dev_Search_Engine☆12Updated last year
- Solve Geometric & Graph Problems with Large Language Models☆29Updated 2 years ago
- Tool to take your ML model from local to production with one-line of code.☆25Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆18Updated 7 months ago
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago