lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆29Updated this week
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆52Updated last month
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Fact checking baseline combining dense retrieval and textual entailment☆29Updated 5 months ago
- The code for LexDrafter framework: a framework that assists in drafting Definitions articles for legislative documents using retrieval au…☆11Updated last month
- ☆67Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆64Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆25Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆32Updated 2 months ago
- Efficient few-shot learning with cross-encoders.☆53Updated last year
- AI assistant, based on the GPT-3.5 model by OpenAI, designed to enhance your proficiency in writing research papers. Allows you to adapt …☆28Updated 7 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆52Updated 8 months ago
- Generalist and Lightweight Model for Text Classification☆134Updated last week
- Split and analyze text files using langchain and streamlit☆48Updated last year
- PDF parser powered by grobid☆28Updated 11 months ago
- Small python package to measure OCR quality and other related metrics.☆23Updated last year
- Repository for deepdoctection tutorial notebooks☆45Updated last week
- ☆23Updated last year
- Universal text classifier for generative models☆24Updated 11 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- Python library to use Pleias-RAG models☆57Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆74Updated 10 months ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Updated 4 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆58Updated last month
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- GLiNER model in a FastAPI microservice.☆44Updated 6 months ago