lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆30Updated 2 weeks ago
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- ☆67Updated last year
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated this week
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- Efficient few-shot learning with cross-encoders.☆58Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆214Updated 7 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆61Updated last year
- Universal text classifier for generative models☆24Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆187Updated 3 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- Generalist and Lightweight Model for Text Classification☆157Updated 3 months ago
- ☆28Updated last year
- multimodal document analysis☆166Updated last year
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆70Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- ☆102Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 10 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆96Updated last year
- ☆23Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 11 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆63Updated 2 weeks ago
- PDF parser powered by grobid☆28Updated last year
- Examples using the Deep Search functionalities☆85Updated 7 months ago