whylabs / langkitLinks
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
☆967Updated last year
Alternatives and similar repositories for langkit
Users that are interested in langkit are comparing it to the libraries listed below
Sorting:
- A tool for evaluating LLMs☆428Updated last year
- LLM Prompt Injection Detector☆1,389Updated last year
- The Security Toolkit for LLM Interactions☆2,314Updated last week
- ☆778Updated 5 months ago
- Automated Evaluation of RAG Systems☆678Updated 8 months ago
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆865Updated last year
- ☆507Updated last year
- Open-source tool to visualise your RAG 🔮☆1,199Updated 11 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆398Updated 2 years ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆2,979Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- ☆982Updated 3 weeks ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆309Updated last month
- Evaluation and Tracking for LLM Experiments and AI Agents☆2,973Updated last week
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆711Updated 2 weeks ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆520Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,840Updated last year
- Deliver safe & effective language models☆547Updated last month
- ☆903Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆320Updated 5 months ago
- ☆468Updated last year
- 🦜💯 Flex those feathers!☆255Updated last year
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,020Updated 7 months ago
- Adding guardrails to large language models.☆6,134Updated this week
- Python SDK for running evaluations on LLM generated responses☆293Updated 6 months ago
- Automatically evaluate your LLMs in Google Colab☆675Updated last year
- 👩🏻🍳 A collection of example notebooks using Haystack☆515Updated this week
- Promptimize is a prompt engineering evaluation and testing toolkit.☆486Updated last week
- An LLM-powered advanced RAG pipeline built from scratch☆854Updated last year
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆503Updated 10 months ago