lfoppiano / document-qaLinks
Scientific Document Insight Q/A
☆31Updated 2 months ago
Alternatives and similar repositories for document-qa
Users that are interested in document-qa are comparing it to the libraries listed below
Sorting:
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated last month
- A handy PDF-to-JSON conversion tool for academic papers implemented in Python.☆71Updated 2 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- ☆67Updated last year
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆80Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆68Updated last year
- Efficient few-shot learning with cross-encoders.☆59Updated last year
- PDF parser powered by grobid☆28Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆73Updated 10 months ago
- ☆28Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated last year
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆96Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆189Updated 5 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆129Updated last year
- multimodal document analysis☆166Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆62Updated last year
- GLiNER model in a FastAPI microservice.☆45Updated 10 months ago
- Universal text classifier for generative models☆25Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆179Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆48Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 10 months ago
- Examples using the Deep Search functionalities☆85Updated 9 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 2 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆47Updated last year
- Generalist and Lightweight Model for Text Classification☆163Updated 4 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆65Updated last year
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆57Updated last year