DS4SD / deepsearch-toolkitLinks
Interact with the Deep Search platform for new knowledge explorations and discoveries
☆213Updated 7 months ago
Alternatives and similar repositories for deepsearch-toolkit
Users that are interested in deepsearch-toolkit are comparing it to the libraries listed below
Sorting:
- Examples using the Deep Search functionalities☆85Updated 7 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆55Updated 7 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆128Updated last year
- Build document-native LLM applications☆54Updated 11 months ago
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- A python library to define and validate data types in Docling.☆173Updated this week
- ☆191Updated last week
- ☆137Updated last month
- Simple package to extract text with coordinates from programmatic PDFs☆176Updated last week
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆69Updated 8 months ago
- ☆122Updated 6 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆236Updated 2 months ago
- Python API for https://vespa.ai, the open big data serving engine☆137Updated this week
- DSPY on action with OpenSource LLMs.☆75Updated last year
- General solution to archetype LLM batch use case☆35Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆232Updated 3 weeks ago
- GLiNER model in a FastAPI microservice.☆45Updated 8 months ago
- 🧪 Experimental features for Haystack☆48Updated last week
- TF-ID: Table/Figure IDentifier for academic papers☆238Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆174Updated 11 months ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆69Updated last year
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆42Updated 4 months ago
- Repository for deepdoctection tutorial notebooks☆46Updated 2 months ago
- Construct knowledge graphs from unstructured data sources, use graph algorithms for enhanced GraphRAG with a BAML-based chat bot, and cur…☆164Updated this week
- ☆210Updated 2 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- Scientific Document Insight Q/A☆30Updated 2 months ago
- multimodal document analysis☆164Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆64Updated 10 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆211Updated 3 months ago