whylabs / langkitLinks

🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀

☆930

Alternatives and similar repositories for langkit

Users that are interested in langkit are comparing it to the libraries listed below

Sorting:

protectai / rebuff
LLM Prompt Injection Detector
☆1,318Updated 11 months ago
arthur-ai / bench
A tool for evaluating LLMs
☆423Updated last year
protectai / llm-guard
The Security Toolkit for LLM Interactions
☆1,889Updated last week
langchain-ai / auto-evaluator
☆772Updated last month
run-llama / finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
☆504Updated last year
dgarnitz / vectorflow
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…
☆697Updated last year
tigerlab-ai / tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
☆398Updated last year
MagnivOrg / prompt-layer-library
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
☆649Updated last week
gabrielchua / RAGxplorer
Open-source tool to visualise your RAG 🔮
☆1,146Updated 6 months ago
finic-ai / doctran
☆507Updated 11 months ago
langchain-ai / langsmith-cookbook
☆939Updated last week
georgian-io / LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
☆850Updated 9 months ago
fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆184Updated last year
stanford-futuredata / ARES
Automated Evaluation of RAG Systems
☆633Updated 4 months ago
truera / trulens
Evaluation and Tracking for LLM Experiments and AI Agents
☆2,675Updated this week
pinecone-io / canopy
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
☆1,019Updated 8 months ago
Raudaschl / rag-fusion
☆885Updated 9 months ago
TonicAI / tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
☆314Updated 3 weeks ago
lunary-ai / lunary
The production toolkit for LLMs. Observability, prompt management and evaluations.
☆1,373Updated this week
hegelai / prompttools
Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…
☆2,910Updated 11 months ago
pchunduri6 / rag-demystified
An LLM-powered advanced RAG pipeline built from scratch
☆845Updated last year
JohnSnowLabs / langtest
Deliver safe & effective language models
☆529Updated this week
wandb / wandbot
wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk
☆302Updated this week
deepset-ai / haystack-cookbook
👩🏻‍🍳 A collection of example notebooks using Haystack
☆487Updated this week
Arize-ai / openinference
OpenTelemetry Instrumentation for AI Observability
☆521Updated this week
philschmid / easyllm
☆461Updated last year
rajshah4 / LLM-Evaluation
Sample notebooks and prompts for LLM evaluation
☆136Updated last month
cohere-ai / notebooks
Code examples and jupyter notebooks for the Cohere Platform
☆504Updated 6 months ago
viddexa / autollm
Ship RAG based LLM web apps in seconds.
☆997Updated last year
langchain-ai / langchain-benchmarks
🦜💯 Flex those feathers!
☆252Updated 9 months ago