butlerlabs / docaiLinks
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Updated 2 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Updated 4 years ago
- ☆22Updated last year
- Repository for deepdoctection tutorial notebooks☆46Updated last month
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆50Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 10 months ago
- Pandas-LLM☆46Updated 2 years ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆128Updated last year
- CRUD Word documents with Python☆11Updated 8 months ago
- ☆47Updated 10 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 9 months ago
- GLiNER model in a FastAPI microservice.☆45Updated 7 months ago
- A chatbot made using the Chatterbot library in Python and locally hosted using Streamlit. Dataset used were collected during ConvAI2 comp…☆15Updated 4 years ago
- Universal text classifier for generative models☆24Updated last year
- Open-source, knowledge-grounded conversational assistant☆13Updated last month
- Microsoft Phi 2 Streamlit App, deployed on HuggingFace Spaces is based on the Microsoft Phi 2 small language model (SLM) for text generat…☆14Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆23Updated last year
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆24Updated last year
- ☆13Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆25Updated last year
- Input text or image, get back matching image fashion results, using Jina, DocArray, and CLIP☆50Updated 2 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆19Updated 8 months ago
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆86Updated 6 months ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy☆17Updated 2 years ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- ☆40Updated 7 months ago