stephenleo / llm-structured-output-benchmarksLinks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.
☆179Updated last year
Alternatives and similar repositories for llm-structured-output-benchmarks
Users that are interested in llm-structured-output-benchmarks are comparing it to the libraries listed below
Sorting:
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆113Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆119Updated last week
- ☆237Updated 3 months ago
- ☆146Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆440Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- A Lightweight Library for AI Observability☆251Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆162Updated 3 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆147Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- ☆210Updated 3 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆336Updated 4 months ago
- FastAPI wrapper around DSPy☆274Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated 11 months ago
- DSPY on action with OpenSource LLMs.☆96Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆116Updated 6 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 11 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆191Updated last year
- Synthetic Data for LLM Fine-Tuning☆120Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆243Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 5 months ago
- ☆124Updated 7 months ago
- Function Calling Benchmark & Testing☆90Updated last year
- ☆119Updated last year
- A Python library to chunk/group your texts based on semantic similarity.☆96Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆49Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆90Updated 3 weeks ago
- Simple UI for debugging correlations of text embeddings☆292Updated 4 months ago