lfoppiano / document-qa
Scientific Document Insight Q/A
☆23Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for document-qa
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆93Updated this week
- End-to-end zero-shot entity and relation extraction☆56Updated 3 months ago
- GLiNER model in a FastAPI microservice.☆28Updated 2 weeks ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆47Updated 7 months ago
- A spaCy wrapper for GliNER☆87Updated 3 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆52Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆129Updated last month
- Pandas-LLM☆31Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆43Updated 3 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆18Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆77Updated 10 months ago
- Let's build better datasets, together!☆202Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 3 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆104Updated 7 months ago
- ☆28Updated 8 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆32Updated 8 months ago
- Examples using the Deep Search functionalities☆42Updated 3 months ago
- Generalist and Lightweight Model for Text Classification☆48Updated 2 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆38Updated this week
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆80Updated 9 months ago
- An LLM training library for instruction-tuning.☆23Updated 8 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆42Updated 2 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆97Updated 6 months ago
- ☆53Updated 10 months ago