lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆29Updated last week
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆73Updated 10 months ago
- The code for LexDrafter framework: a framework that assists in drafting Definitions articles for legislative documents using retrieval au…☆11Updated 3 weeks ago
- Universal text classifier for generative models☆24Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆63Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated 2 months ago
- ☆14Updated last year
- A spaCy wrapper for GliNER☆115Updated 4 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆31Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 9 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆30Updated 8 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated last year
- GLiNER model in a FastAPI microservice.☆44Updated 5 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆45Updated last year
- Viewer for the structure extracted by Grobid on PDF documents☆51Updated last month
- ☆67Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆212Updated last month
- Fact checking baseline combining dense retrieval and textual entailment☆29Updated 4 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 9 months ago
- Open-Source Synthetic Text Dataset Generation for LLM projects☆27Updated last week
- A RAG that can scale 🧑🏻💻☆11Updated last year