citadel-ai / langcheckLinks
Simple, Pythonic building blocks to evaluate LLM applications.
☆230Updated this week
Alternatives and similar repositories for langcheck
Users that are interested in langcheck are comparing it to the libraries listed below
Sorting:
- Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.☆29Updated 3 weeks ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 9 months ago
- Library to ask OpenAI GPT for generating objects on the Python runtime.☆195Updated 2 years ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 6 months ago
- ☆61Updated last year
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆271Updated last month
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆158Updated this week
- Project of llm evaluation to Japanese tasks☆84Updated this week
- Build MLOps Pipelines in Minutes☆245Updated this week
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆302Updated last week
- ☆136Updated last week
- A small library of LLM judges☆228Updated 2 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- ☆168Updated last year
- ☆99Updated 2 months ago
- ☆49Updated last year
- A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)☆694Updated 3 weeks ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆74Updated 4 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆311Updated last month
- A tool for evaluating LLMs☆424Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆183Updated last year
- ☆264Updated last year
- ☆310Updated 8 months ago
- 🦜💯 Flex those feathers!☆251Updated 8 months ago
- chat to visualization with LLM☆254Updated last year
- Tutorial for building LLM router☆216Updated 11 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆65Updated last year
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆122Updated this week
- ☆92Updated last year