stephenleo / llm-structured-output-benchmarksLinks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.
☆173Updated 9 months ago
Alternatives and similar repositories for llm-structured-output-benchmarks
Users that are interested in llm-structured-output-benchmarks are comparing it to the libraries listed below
Sorting:
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆107Updated last year
- ☆227Updated last month
- ☆145Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆113Updated last week
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆186Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆218Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- A Lightweight Library for AI Observability☆246Updated 5 months ago
- Function Calling Benchmark & Testing☆88Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆316Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆74Updated 8 months ago
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆111Updated 10 months ago
- ☆76Updated 6 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆432Updated last year
- FastAPI wrapper around DSPy☆253Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- This repository implements the chain of verification paper by Meta AI☆171Updated last year
- Simple UI for debugging correlations of text embeddings☆288Updated last month
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆353Updated 4 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆343Updated last month
- DSPY on action with OpenSource LLMs.☆72Updated last year
- Synthetic Data for LLM Fine-Tuning☆119Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- ☆118Updated 10 months ago
- Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆115Updated 7 months ago
- Generalist and Lightweight Model for Text Classification☆139Updated last month