DS4SD / deepsearch-toolkit
Interact with the Deep Search platform for new knowledge explorations and discoveries
☆135Updated last month
Related projects ⓘ
Alternatives and complementary repositories for deepsearch-toolkit
- Examples using the Deep Search functionalities☆47Updated 3 months ago
- ☆43Updated this week
- A python library to define and validate data types in Docling.☆34Updated this week
- Build document-native LLM applications☆51Updated 2 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆116Updated this week
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆146Updated this week
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆79Updated 10 months ago
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆24Updated last month
- Scientific Document Insight Q/A☆23Updated this week
- Generalist and Lightweight Model for Text Classification☆51Updated last week
- A spaCy wrapper for GliNER☆91Updated 4 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆55Updated 6 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆35Updated 4 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆40Updated 2 weeks ago
- Let's build better datasets, together!☆206Updated this week
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆19Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- End-to-end zero-shot entity and relation extraction☆58Updated 3 months ago
- ☆82Updated 6 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆348Updated 7 months ago
- Open source project for data preparation of LLM application builders☆312Updated this week
- a unified framework for leveraging LLMs☆58Updated this week
- ☆46Updated 9 months ago
- SpanMarker for Named Entity Recognition☆404Updated 4 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆110Updated 4 months ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆340Updated last month
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 9 months ago