DS4SD/deepsearch-toolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DS4SD/deepsearch-toolkit)

DS4SD / deepsearch-toolkit

Interact with the Deep Search platform for new knowledge explorations and discoveries

☆228

Alternatives and similar repositories for deepsearch-toolkit

Users that are interested in deepsearch-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DS4SD / deepsearch-examples
View on GitHub
Examples using the Deep Search functionalities
☆90Jan 29, 2025Updated last year
DS4SD / deepsearch-glm
View on GitHub
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆60Jan 27, 2025Updated last year
DS4SD / quackling
View on GitHub
Build document-native LLM applications
☆58Sep 11, 2024Updated last year
DS4SD / DocLayNet
View on GitHub
DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
☆450Feb 1, 2023Updated 3 years ago
docling-project / docling-haystack
View on GitHub
Docling Haystack integration
☆29Apr 9, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DS4SD / PatCID
View on GitHub
[Nat. Commun.] PatCID: an open-access dataset of chemical structures in patent documents
☆75Oct 27, 2025Updated 8 months ago
DS4SD / SemTabNet
View on GitHub
Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
☆17Jul 1, 2024Updated 2 years ago
docling-project / docling-parse
View on GitHub
Simple package to extract text with coordinates from programmatic PDFs
☆326Updated this week
docling-project / docling-operator
View on GitHub
☆16Apr 8, 2026Updated 3 months ago
DS4SD / MarkushGenerator
View on GitHub
[CVPR 25] MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures
☆15Mar 22, 2026Updated 4 months ago
IBM / bob-demo
View on GitHub
A playground of fun, bite-sized IBM Bob demos. Because learning works better when you push it.
☆35May 7, 2026Updated 2 months ago
davidberenstein1957 / spacy-setfit
View on GitHub
This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.
☆84Aug 31, 2023Updated 2 years ago
data-prep-kit / data-prep-kit
View on GitHub
Open source project for data preparation for GenAI applications
☆949Jul 14, 2026Updated last week
IBM / Bridge-Operator
View on GitHub
Bridge operator repo
☆22Sep 17, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JoelNiklaus / LegalDatasets
View on GitHub
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆19Jun 16, 2023Updated 3 years ago
IBM / hspo-ontology
View on GitHub
Ontology representing a 360-view of a person (or cohort) that spans across multiple domains, from health to social.
☆35Sep 17, 2025Updated 10 months ago
IBM / SynthTabNet
View on GitHub
Dataset of PNG images from synthetically generated table layouts with annotations in JSONL files
☆154Sep 17, 2025Updated 10 months ago
IBM / watsonx-ai-platform-demos
View on GitHub
AI Agents, LLM Fine-tuning, Developer Productivity, Governance, IBM watsonx
☆52Jan 7, 2026Updated 6 months ago
IBM / eval-assist
View on GitHub
EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other la…
☆102Apr 9, 2026Updated 3 months ago
slimgroup / Devito4PyTorch
View on GitHub
Integrating Devito operators into PyTorch
☆13Mar 17, 2021Updated 5 years ago
Qiskit / qiskit-ibm-transpiler
View on GitHub
A library to use the Qiskit Transpiler Service and the AI-powered transpiler passes.
☆39Mar 10, 2026Updated 4 months ago
watson-developer-cloud / watsonx-orchestrate-developer-toolkit
View on GitHub
☆14Apr 27, 2026Updated 2 months ago
ROCm / RIXL
View on GitHub
DEPRECATED REPOSITORY. ROCm Inference Transfer Library (RIXL) is a port of the NIXL library for AMD GPUs. See README_rocm.md for AMD spe…
☆15Jun 10, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rhoai-mlops / lab-instructions
View on GitHub
This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/
☆16Jun 18, 2026Updated last month
EFS-OpenSource / superb-data-kraken-organizationmanager
View on GitHub
Designed for managing organizations and spaces within the SDK. Organizations group use-cases, while Spaces represent individual use-cases…
☆10Dec 14, 2023Updated 2 years ago
opencitations / wcw
View on GitHub
Wikipedia Citations in Wikidata
☆10May 6, 2021Updated 5 years ago
rahulnyk / fountain-pen
View on GitHub
☆14Aug 9, 2024Updated last year
aidatatools / open-deepsearch
View on GitHub
open-deepsearch
☆11Mar 3, 2025Updated last year
doclang-project / doclang
View on GitHub
DocLang spec and reference toolkit
☆519Jul 15, 2026Updated last week
IBM / opensource-ai-workshop
View on GitHub
Open Source AI with Granite and Granite Code
☆27Oct 6, 2025Updated 9 months ago
GT4SD / molgx-core
View on GitHub
IBM Molecule Generation Experience (MolGX) is a tool to accelerate an AI-driven design of new materials.
☆16Oct 26, 2022Updated 3 years ago
jharmison-redhat / openshift-setup
View on GitHub
☆19Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
instructlab / taxonomy
View on GitHub
Taxonomy tree that will allow you to create models tuned with your data
☆299Sep 8, 2025Updated 10 months ago
tobischimanski / pdfQA
View on GitHub
Benchmarking QA systems on PDFs
☆17Feb 20, 2026Updated 5 months ago
intellectronica / battle-of-the-semantics
View on GitHub
GraphRag vs Embeddings
☆16Jul 14, 2024Updated 2 years ago
GT4SD / zero-shot-bert-adapters
View on GitHub
Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.
☆44Jun 13, 2023Updated 3 years ago
PaccMann / paccmann_rl
View on GitHub
Code pipeline for the PaccMann^RL in iScience: https://www.cell.com/iscience/fulltext/S2589-0042(21)00237-6
☆34Feb 10, 2022Updated 4 years ago
PaccMann / paccmann_datasets
View on GitHub
pytoda - PaccMann PyTorch Dataset Classes. Read the docs: https://paccmann.github.io/paccmann_datasets/
☆29Dec 13, 2025Updated 7 months ago
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago