lfoppiano / document-qa
Scientific Document Insight Q/A
☆29Updated 3 weeks ago
Alternatives and similar repositories for document-qa:
Users that are interested in document-qa are comparing it to the libraries listed below
- Viewer for the structure extracted by Grobid on PDF documents☆48Updated 2 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated last month
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆70Updated 8 months ago
- PDF parser powered by grobid☆26Updated 8 months ago
- PhD Dissertation "Automated Extraction and Curation of Materials Information from Scientific Literature"☆9Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆103Updated last year
- Generalist and Lightweight Model for Text Classification☆121Updated 2 weeks ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated 11 months ago
- Repository for deepdoctection tutorial notebooks☆44Updated 4 months ago
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆88Updated last year
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆28Updated 3 months ago
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆29Updated 2 weeks ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆64Updated 4 months ago
- GLiNER model in a FastAPI microservice.☆41Updated 4 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- A spaCy wrapper for GliNER☆112Updated 2 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆57Updated 8 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆28Updated 7 months ago
- An easy way to chunk spaCy docs.☆19Updated 8 months ago
- Examples using the Deep Search functionalities☆71Updated 2 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆44Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.