citadel-ai / langcheckLinks

Simple, Pythonic building blocks to evaluate LLM applications.

☆232

Alternatives and similar repositories for langcheck

Users that are interested in langcheck are comparing it to the libraries listed below

Sorting:

odashi / davinci-functions
Library to ask OpenAI GPT for generating objects on the Python runtime.
☆195Updated 2 years ago
gaudiy / langsmith-evaluation-helper
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
☆29Updated last month
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 10 months ago
fladdict / llmermaid
☆61Updated last year
stephenleo / llm-structured-output-benchmarks
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…
☆173Updated 10 months ago
anthropics / anthropic-tools
☆314Updated 9 months ago
neodyland / entropix
Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral
☆17Updated 6 months ago
naotaka1128 / ai_app_book
☆26Updated last year
stanford-oval / suql
SUQL: Conversational Search over Structured and Unstructured Data with LLMs
☆278Updated 2 weeks ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆250Updated 5 months ago
deepset-ai / haystack-core-integrations
Additional packages (components, document stores and the likes) to extend the capabilities of Haystack
☆160Updated this week
cohere-ai / cohere-finetune
A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models
☆74Updated 4 months ago
deepset-ai / prompthub
☆173Updated last year
microsoft / sammo
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)
☆716Updated last month
llm-jp / llm-jp-eval
☆137Updated this week
wandb / wandbot
wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk
☆302Updated last week
run-llama / ai-engineer-workshop
☆185Updated last year
jina-ai / correlations
Simple UI for debugging correlations of text embeddings
☆288Updated 2 months ago
jxnl / n-levels-of-rag
☆195Updated last year
apple / ml-superposition-prompting
☆145Updated last year
567-labs / kura
Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…
☆260Updated last month
anthropics / anthropic-retrieval-demo
Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…
☆165Updated last year
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆148Updated last month
yuzu-ai / japanese-llm-ranking
☆49Updated last year
KarelDO / xmc.dspy
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆434Updated last year
quotient-ai / judges
A small library of LLM judges
☆248Updated last week
GoogleCloudPlatform / automlops
Build MLOps Pipelines in Minutes
☆247Updated this week
eugeneyan / align-app
☆77Updated 8 months ago
arthur-ai / bench
A tool for evaluating LLMs
☆424Updated last year
PrithivirajDamodaran / Route0x
Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da
☆111Updated 4 months ago