Unstructured-IO / unstructured-inference
☆176Updated this week
Alternatives and similar repositories for unstructured-inference:
Users that are interested in unstructured-inference are comparing it to the libraries listed below
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆265Updated this week
- A Python library to chunk/group your texts based on semantic similarity.☆94Updated 8 months ago
- UniTable: Towards a Unified Table Foundation Model☆445Updated 9 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆489Updated last year
- A Python client for the Unstructured Platform API☆97Updated this week
- DocLLM: A layout-aware generative language model for multimodal document understanding☆123Updated last year
- ☆60Updated 11 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆413Updated last week
- ☆214Updated 3 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆235Updated 5 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆352Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆158Updated 6 months ago
- 🦜💯 Flex those feathers!☆242Updated 5 months ago
- ChatData 🔍 📖 brings RAG to real applications with FREE✨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milli…☆168Updated 4 months ago
- data cleaning and curation for unstructured text☆329Updated 7 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆173Updated 6 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆83Updated last week
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆288Updated 4 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆183Updated this week
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆171Updated 11 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆61Updated 2 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆316Updated 4 months ago
- Excel spreadsheet crawler and table parser for data extraction and querying☆130Updated 3 weeks ago
- An enterprise-grade AI retriever designed to streamline AI integration into your applications, ensuring cutting-edge accuracy.☆282Updated 2 weeks ago
- Using LlamaIndex, Redis, and OpenAI to chat with PDF documents. Supplementary material for blog post on Microsoft Developer Blog☆111Updated last year
- Visualize Different Text Splitting Methods☆234Updated 2 months ago
- ☆118Updated 3 weeks ago
- Python API for https://vespa.ai, the open big data serving engine☆115Updated this week