Unstructured-IO / unstructured.PaddleOCRLinks
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
☆40Updated 8 months ago
Alternatives and similar repositories for unstructured.PaddleOCR
Users that are interested in unstructured.PaddleOCR are comparing it to the libraries listed below
Sorting:
- Open-source observability for your LLM application.☆53Updated 11 months ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆78Updated last month
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.☆164Updated 2 years ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated 2 years ago
- Elasticsearch integration into LangChain☆68Updated last week
- Self-host LLMs with vLLM and BentoML☆161Updated 2 weeks ago
- Develop, evaluate and monitor LLM applications at scale☆98Updated last year
- ☆199Updated 2 weeks ago
- simplifies the process of creating and managing LLM workflows.☆112Updated last year
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆57Updated last year
- Data extraction with Donut ML model☆57Updated last year
- Private ChatGPT/Perplexity. Securely unlocks knowledge from confidential business information.☆77Updated last year
- Build document-native LLM applications☆54Updated last year
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆177Updated last year
- Turn any OCR models into online inference API endpoint 🚀 🌖☆57Updated last month
- ☆66Updated 8 months ago
- Self-host llmapi server, make it really easy for accessing LLMs !☆37Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆77Updated this week
- Run AI generated code in isolated sandboxes☆126Updated 10 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆44Updated 11 months ago
- An experimental and alternative approach to Finetuning and RAG.☆34Updated 2 years ago
- Open Source Text Embedding Models with OpenAI Compatible API☆164Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- ☆20Updated 10 months ago
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.☆51Updated last year
- Unattended Lightweight Text Classifiers with LLM Embeddings☆186Updated last year
- Split and analyze text files using langchain and streamlit☆50Updated last year
- Run LLM-related tools in containers.☆55Updated last year
- FalkorDB-Browser is a visualization UI for FalkorDB.☆76Updated last week
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆292Updated 5 months ago