lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆31Updated last month
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated 3 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆66Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 8 months ago
- ☆67Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆189Updated 4 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆70Updated 9 months ago
- Examples using the Deep Search functionalities☆84Updated 8 months ago
- ☆28Updated last year
- Universal text classifier for generative models☆25Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-par…☆63Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Generalist and Lightweight Model for Text Classification☆163Updated 3 months ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆70Updated 2 years ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆65Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- A curated list of materials on AI guardails☆40Updated 4 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- A spaCy wrapper for GliNER☆121Updated 8 months ago
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆95Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆119Updated 2 weeks ago
- ☆55Updated last year
- GLiNER model in a FastAPI microservice.☆45Updated 9 months ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago