DS4SD / deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries
☆197Updated 3 months ago
Alternatives and similar repositories for deepsearch-toolkit:
Users that are interested in deepsearch-toolkit are comparing it to the libraries listed below
- Examples using the Deep Search functionalities☆76Updated 3 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆48Updated 3 months ago
- Build document-native LLM applications☆53Updated 7 months ago
- A python library to define and validate data types in Docling.☆127Updated this week
- ☆113Updated 2 weeks ago
- Generalist and Lightweight Model for Text Classification☆124Updated last week
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆418Updated 7 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆206Updated 3 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆116Updated last month
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆338Updated 2 years ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆165Updated 7 months ago
- ☆121Updated 2 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆125Updated last year
- How to construct knowledge graphs from unstructured data sources☆125Updated 7 months ago
- TF-ID: Table/Figure IDentifier for academic papers☆231Updated 9 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆51Updated 6 months ago
- ☆180Updated 3 weeks ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆179Updated last year
- Schemas for WhyHow's automated knowledge graph creation SDK☆89Updated 8 months ago
- ☆74Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆421Updated last year
- multimodal document analysis☆164Updated 11 months ago
- A spaCy wrapper for GliNER☆114Updated 3 months ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆109Updated 9 months ago
- Python PDF parser for scientific publications: content and figures☆404Updated last year
- Python package that adds IntelligentGraph capabilities to RDFLib RDF graph package☆55Updated last year
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆49Updated 7 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆202Updated this week
- ☆101Updated last year
- ☆254Updated 5 months ago