citadel-ai / langcheckLinks
Simple, Pythonic building blocks to evaluate LLM applications.
☆246Updated last month
Alternatives and similar repositories for langcheck
Users that are interested in langcheck are comparing it to the libraries listed below
Sorting:
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.☆31Updated 6 months ago
- ☆69Updated 2 years ago
- Library to ask OpenAI GPT for generating objects on the Python runtime.☆195Updated 2 years ago
- ☆145Updated 2 weeks ago
- ☆50Updated last year
- Project of llm evaluation to Japanese tasks☆91Updated this week
- A small library of LLM judges☆311Updated 5 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆156Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆181Updated last year
- ☆336Updated last year
- ☆184Updated 2 years ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆293Updated 2 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 11 months ago
- A Lightweight Library for AI Observability☆252Updated 10 months ago
- Benchmark for Japanese document embedding & vector search☆29Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆446Updated last year
- Build MLOps Pipelines in Minutes☆252Updated 4 months ago
- Python + Markdown framework for building internal apps.☆109Updated 8 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆309Updated 2 months ago
- Evaluating the performance of LLMs on Japanese challenging financial tasks.☆28Updated 5 months ago
- ☆148Updated last year
- JQaRA: Japanese Question Answering with Retrieval Augmentation - 検索拡張(RAG)評価のための日本語Q&Aデータセット☆41Updated 3 months ago
- A tool for evaluating LLMs☆428Updated last year
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆177Updated this week
- ☆93Updated 2 years ago
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆76Updated 9 months ago
- ☆274Updated last year
- ☆62Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆125Updated 2 months ago