stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.
☆117Updated 3 weeks ago
Related projects: ⓘ
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆143Updated 5 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆91Updated 5 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- ☆126Updated 2 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆101Updated last week
- awesome synthetic (text) datasets☆213Updated this week
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆77Updated 7 months ago
- FastAPI wrapper around DSPy☆201Updated 6 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆161Updated 4 months ago
- ☆82Updated 3 weeks ago
- Let's build better datasets, together!☆195Updated last month
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆147Updated 5 months ago
- ☆70Updated 3 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆117Updated this week
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆236Updated last week
- This repository implements the chain of verification paper by Meta AI☆151Updated 11 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆99Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆115Updated 3 months ago
- DSPY on action with OpenSource LLMs.☆49Updated 5 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆124Updated this week
- ☆75Updated 3 weeks ago
- StructuredRAG Benchmarker☆85Updated this week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated 7 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆80Updated 3 weeks ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆119Updated 8 months ago
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆45Updated 8 months ago