citadel-ai / langcheckLinks
Simple, Pythonic building blocks to evaluate LLM applications.
☆232Updated 2 weeks ago
Alternatives and similar repositories for langcheck
Users that are interested in langcheck are comparing it to the libraries listed below
Sorting:
- Library to ask OpenAI GPT for generating objects on the Python runtime.☆195Updated 2 years ago
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.☆29Updated last month
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 10 months ago
- ☆61Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 10 months ago
- ☆314Updated 9 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 6 months ago
- ☆26Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆278Updated 2 weeks ago
- A Lightweight Library for AI Observability☆250Updated 5 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆160Updated this week
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆74Updated 4 months ago
- ☆173Updated last year
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆716Updated last month
- ☆137Updated this week
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆302Updated last week
- ☆185Updated last year
- Simple UI for debugging correlations of text embeddings☆288Updated 2 months ago
- ☆195Updated last year
- ☆145Updated last year
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆260Updated last month
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆165Updated last year
- Generalist and Lightweight Model for Text Classification☆148Updated last month
- ☆49Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆434Updated last year
- A small library of LLM judges☆248Updated last week
- Build MLOps Pipelines in Minutes☆247Updated this week
- ☆77Updated 8 months ago
- A tool for evaluating LLMs☆424Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆111Updated 4 months ago