lfoppiano / document-qa
Scientific Document Insight Q/A
☆23Updated this week
Related projects ⓘ
Alternatives and complementary repositories for document-qa
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆146Updated this week
- Examples using the Deep Search functionalities☆47Updated 3 months ago
- Code to extract Knowledge Graph from normal, unstructured text and visualize the resulting graph☆50Updated 7 months ago
- Streamlit Annotation Tools is a Streamlit component that gives you access to various annotation tools (labeling, highlighting, etc.) for …☆79Updated 10 months ago
- End-to-end zero-shot entity and relation extraction☆58Updated 3 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 9 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆50Updated 8 months ago
- Viewer for the structure extracted by Grobid on PDF documents☆40Updated 2 weeks ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆21Updated last month
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆44Updated 2 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆46Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆134Updated 2 months ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆11Updated 3 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 3 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- Chunk your text using gpt4o-mini more accurately☆42Updated 3 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆106Updated 7 months ago
- ☆105Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆75Updated 5 months ago
- A spaCy wrapper for GliNER☆91Updated 4 months ago
- ☆53Updated 10 months ago
- 📚 Process PDFs, Word documents and more with spaCy☆116Updated this week
- A chatbot made using the Chatterbot library in Python and locally hosted using Streamlit. Dataset used were collected during ConvAI2 comp…☆15Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆26Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- Efficient few-shot learning with cross-encoders.☆40Updated 9 months ago