butlerlabs / docaiLinks
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
☆20Updated 2 years ago
Alternatives and similar repositories for docai
Users that are interested in docai are comparing it to the libraries listed below
Sorting:
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Updated 4 years ago
- ☆22Updated last year
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- A chatbot made using the Chatterbot library in Python and locally hosted using Streamlit. Dataset used were collected during ConvAI2 comp…☆15Updated 4 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated last week
- Repository for deepdoctection tutorial notebooks☆46Updated 3 months ago
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆37Updated 2 years ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.☆50Updated last week
- Pandas-LLM☆46Updated 2 years ago
- ☆12Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- ☆14Updated last year
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy☆17Updated 2 years ago
- A Streamlit app for showing a TimelineJS about the history of Natural Language Processing☆29Updated last year
- Query, ask and chat with a document-index via transformer models!☆17Updated 2 years ago
- CRUD Word documents with Python☆12Updated 10 months ago
- Streamlit component for Jina neural search☆42Updated 3 years ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆22Updated last year
- ☆50Updated last year
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- GLiNER model in a FastAPI microservice.☆45Updated 9 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Using ChatGPT to build a Kedro ML pipeline and Streamlit frontend☆30Updated 2 years ago
- Full-fledged Data Exploration Tool for Label Studio☆48Updated last year
- High-Performance Transformers for Table Structure Recognition Need Early Convolutions☆44Updated last year
- Universal text classifier for generative models☆25Updated last year