citadel-ai / langcheck
Simple, Pythonic building blocks to evaluate LLM applications.
☆221Updated last week
Alternatives and similar repositories for langcheck:
Users that are interested in langcheck are comparing it to the libraries listed below
- Python SDK for running evaluations on LLM generated responses☆277Updated last week
- Deploy Haystack pipelines behind a REST Api.☆74Updated last week
- ☆129Updated 2 weeks ago
- A Lightweight Library for AI Observability☆241Updated 2 months ago
- ☆194Updated 11 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆149Updated 6 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆163Updated 7 months ago
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.☆29Updated 2 weeks ago
- ☆161Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆422Updated last year
- ☆299Updated 5 months ago
- Library to ask OpenAI GPT for generating objects on the Python runtime.☆195Updated 2 years ago
- ☆67Updated 5 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆296Updated this week
- Project of llm evaluation to Japanese tasks☆81Updated 2 months ago
- ☆116Updated this week
- ☆176Updated 6 months ago
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆667Updated 4 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆114Updated last week
- ☆257Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆263Updated last month
- ⛓️ build cognitive systems, pythonic☆335Updated 5 months ago
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆159Updated 9 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆102Updated 3 weeks ago
- ☆52Updated last year
- A simple Python sandbox for helpful LLM data agents☆250Updated 10 months ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆73Updated last month
- Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python☆47Updated last month
- ☆197Updated last year
- Synthetic Data for LLM Fine-Tuning☆114Updated last year