Unstructured-IO / unstructured.PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
☆30Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for unstructured.PaddleOCR
- Self-host LLMs with vLLM and BentoML☆72Updated last week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 2 weeks ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆27Updated 11 months ago
- ☆21Updated 5 months ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆37Updated 3 months ago
- ☆37Updated 11 months ago
- Data Questionnaire Agent Chatbot☆61Updated 3 weeks ago
- AI search: your data + 10 lines of code.☆73Updated 3 months ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆93Updated this week
- ☆27Updated 4 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Dynamic Metadata based RAG Framework☆71Updated 3 months ago
- Develop, evaluate and monitor LLM applications at scale☆93Updated this week
- DSPY on action with OpenSource LLMs.☆54Updated 7 months ago
- Natural Language Interfaces Powered by LLMs☆91Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- A prompting library☆123Updated last month
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆54Updated 2 months ago
- Open source and AI-powered web search engine: local, private, dockerized and supported by a fluffy llama🦙☆51Updated 3 months ago
- ☆20Updated 9 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆62Updated this week
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- Embed anything.☆29Updated 5 months ago
- Retrieval of fully structured data made easy. Use LLMs or custom models. Specialized on PDFs and HTML files. Extensive support of tabular…☆54Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆42Updated 2 months ago
- Record and replay LLM interactions for langchain☆78Updated 4 months ago
- Python API for https://vespa.ai, the open big data serving engine☆101Updated this week
- A microframework for creating simple AI agents.☆88Updated 3 months ago
- Build reliable, secure, and production-ready AI apps easily.☆45Updated this week