lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆29Updated last month
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- Viewer for the structure extracted by Grobid on PDF documents☆52Updated 3 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 4 months ago
- Fact checking baseline combining dense retrieval and textual entailment☆30Updated 6 months ago
- Efficient few-shot learning with cross-encoders.☆56Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated last year
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆67Updated 6 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆69Updated 7 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆76Updated last year
- Generalist and Lightweight Model for Text Classification☆148Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- multimodal document analysis☆165Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated last year
- A spaCy wrapper for GliNER☆118Updated 6 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆133Updated 7 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆62Updated 2 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- ☆67Updated last year
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆92Updated last year
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆69Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 11 months ago
- Parsee's PDF reader, specialized on the extraction of tables with numeric values and the accurate extraction and preservation of text-par…☆60Updated last month
- GLiNER model in a FastAPI microservice.☆45Updated 7 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 9 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆183Updated 2 months ago
- AI assistant, based on the GPT-3.5 model by OpenAI, designed to enhance your proficiency in writing research papers. Allows you to adapt …☆28Updated 9 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆32Updated 3 months ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Updated 4 years ago