lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆33Updated 4 months ago
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 9 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆57Updated 2 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- ☆67Updated last year
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆221Updated 11 months ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Updated 4 years ago
- A spaCy wrapper for GliNER☆128Updated 11 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆83Updated last year
- ☆28Updated last year
- Knowledge Graph Generator app☆34Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 4 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆71Updated 2 years ago
- Universal text classifier for generative models☆24Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆197Updated 7 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- multimodal document analysis☆166Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆125Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Generalist and Lightweight Model for Text Classification☆166Updated last month
- Synthetic Text Dataset Generation for LLM projects☆55Updated last month
- PDF parser powered by grobid☆27Updated last year
- Pandas-LLM☆46Updated 2 years ago
- GLiNER model in a FastAPI microservice.☆47Updated last year