citadel-ai / langcheck
Simple, Pythonic building blocks to evaluate LLM applications.
β183Updated last week
Related projects: β
- π€ A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.β35Updated 3 months ago
- β254Updated 5 months ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMsβ194Updated 3 weeks ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busineβ¦β136Updated 2 weeks ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.β¦β91Updated 2 months ago
- β134Updated 8 months ago
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)β493Updated this week
- A simple Python sandbox for helpful LLM data agentsβ143Updated 3 months ago
- Library to ask OpenAI GPT for generating objects on the Python runtime.β195Updated last year
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β330Updated this week
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)β72Updated last week
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!β79Updated 7 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β362Updated 7 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β117Updated 3 weeks ago
- Project of llm evaluation to Japanese tasksβ67Updated last week
- Python client library for improving your LLM app accuracyβ94Updated this week
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.β22Updated last month
- Tutorial for building LLM routerβ145Updated 2 months ago
- β56Updated last week
- β85Updated 10 months ago
- Let's build better datasets, together!β195Updated last month
- π¦π― Flex those feathers!β227Updated last month
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledgeβ¦β117Updated 2 months ago
- β58Updated 3 weeks ago
- β24Updated 4 months ago
- β203Updated 2 months ago
- FastAPI wrapper around DSPyβ201Updated 6 months ago
- Approximation of the Claude 3 tokenizer by inspecting generation streamβ109Updated last month
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β101Updated last week
- β93Updated this week