lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆30Updated 2 months ago
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- Viewer for the structure extracted by Grobid on PDF documents☆53Updated 3 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 5 months ago
- ☆67Updated last year
- Examples using the Deep Search functionalities☆85Updated 7 months ago
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆69Updated last year
- Efficient few-shot learning with cross-encoders.☆56Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆78Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆213Updated 7 months ago
- multimodal document analysis☆164Updated last year
- Universal text classifier for generative models☆24Updated last year
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- GLiNER model in a FastAPI microservice.☆45Updated 8 months ago
- ☆98Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- A curated list of materials on AI guardails☆40Updated 2 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110Updated last year
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆94Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated 11 months ago
- A spaCy wrapper for GliNER☆119Updated 7 months ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 8 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆185Updated 3 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Robust and fast topic models with sentence-transformers.☆78Updated last month
- ☆27Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆69Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year