stephenleo / llm-structured-output-benchmarksLinks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.
☆179Updated last year
Alternatives and similar repositories for llm-structured-output-benchmarks
Users that are interested in llm-structured-output-benchmarks are comparing it to the libraries listed below
Sorting:
- ☆146Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆114Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆120Updated this week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆146Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆336Updated 4 months ago
- ☆237Updated 4 months ago
- Generalist and Lightweight Model for Text Classification☆163Updated 4 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆442Updated last year
- A Lightweight Library for AI Observability☆251Updated 8 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- FastAPI wrapper around DSPy☆277Updated last year
- Simple UI for debugging correlations of text embeddings☆296Updated 5 months ago
- A small library of LLM judges☆296Updated 2 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆117Updated 6 months ago
- ☆124Updated 8 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆244Updated 2 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 3 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆400Updated this week
- A Python library to chunk/group your texts based on semantic similarity.☆97Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆72Updated 10 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆151Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆110Updated 6 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated last month
- awesome synthetic (text) datasets☆302Updated 3 months ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j.☆194Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 11 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year