lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆31Updated 2 months ago
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- Viewer for the structure extracted by Grobid on PDF documents☆55Updated 2 weeks ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 8 months ago
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆81Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆219Updated 9 months ago
- Universal text classifier for generative models☆25Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- ☆67Updated last year
- Examples using the Deep Search functionalities☆85Updated 9 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆62Updated last year
- A Python library to chunk/group your texts based on semantic similarity.☆99Updated last year
- ☆106Updated 3 weeks ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-par…☆64Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆48Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- multimodal document analysis☆166Updated last week
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 3 months ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 3 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Generalist and Lightweight Model for Text Classification☆164Updated 5 months ago
- Evaluation framework for document processing models and services.☆55Updated last week
- ☆55Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆246Updated 5 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆191Updated 5 months ago
- GLiNER model in a FastAPI microservice.☆45Updated 11 months ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆30Updated 3 months ago