lfoppiano / document-qa
Scientific Document Insight Q/A
☆29Updated last month
Alternatives and similar repositories for document-qa:
Users that are interested in document-qa are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆48Updated last week
- A spaCy wrapper for GliNER☆108Updated last month
- Open Access PDF harvester, metadata aggregator and full-text ingester☆61Updated 10 months ago
- Fact checking baseline combining dense retrieval and textual entailment☆28Updated 2 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆43Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆68Updated 7 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆61Updated 3 months ago
- Repository of the ICNLSP 2024 paper "Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes…☆13Updated 2 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆28Updated 3 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆47Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆78Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 7 months ago
- PhD Dissertation "Automated Extraction and Curation of Materials Information from Scientific Literature"☆9Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- GLiNER model in a FastAPI microservice.☆39Updated 3 months ago
- Pandas-LLM☆40Updated last year
- ☆85Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆92Updated this week
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆19Updated last month
- Mistral + Haystack: build RAG pipelines that rock 🤘☆103Updated last year
- Knowledge Graph Generator app☆30Updated 11 months ago
- ☆13Updated last year
- Universal text classifier for generative models☆22Updated 8 months ago
- Examples using the Deep Search functionalities☆69Updated last month
- An easy way to chunk spaCy docs.☆19Updated 7 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated 6 months ago
- PDF parser powered by grobid☆25Updated 7 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago