whylabs / langkitLinks
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring safety & security. 🛡️ Features include text quality, relevance metrics, & sentiment analysis. 📊 A comprehensive tool for LLM observability. 👀
☆952Updated 10 months ago
Alternatives and similar repositories for langkit
Users that are interested in langkit are comparing it to the libraries listed below
Sorting:
- A tool for evaluating LLMs☆424Updated last year
- LLM Prompt Injection Detector☆1,358Updated last year
- Evaluation and Tracking for LLM Experiments and AI Agents☆2,811Updated this week
- Fine-Tuning Embedding for RAG with Synthetic Data☆511Updated 2 years ago
- The Security Toolkit for LLM Interactions☆2,101Updated 2 weeks ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆399Updated last year
- Deliver safe & effective language models☆541Updated this week
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆854Updated 11 months ago
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆671Updated 2 weeks ago
- ☆507Updated last year
- ☆775Updated 3 months ago
- Open-source tool to visualise your RAG 🔮☆1,158Updated 8 months ago
- ☆895Updated 11 months ago
- ☆966Updated 2 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 2 months ago
- ☆462Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone☆1,022Updated 10 months ago
- Automated Evaluation of RAG Systems☆658Updated 6 months ago
- 🦜💯 Flex those feathers!☆252Updated 11 months ago
- wandbot is a technical support bot for Weights & Biases' AI developer tools that can run in Discord, Slack, ChatGPT and Zendesk☆307Updated last month
- A comprehensive guide to building RAG-based LLM applications for production.☆1,835Updated last year
- Python SDK for running evaluations on LLM generated responses☆292Updated 3 months ago
- The production toolkit for LLMs. Observability, prompt management and evaluations.☆1,415Updated last week
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆484Updated 7 months ago
- OpenTelemetry Instrumentation for AI Observability☆608Updated this week
- Document Q&A over The Full Stack's Corpus☆359Updated last year
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆623Updated 4 months ago
- Ship RAG based LLM web apps in seconds.☆999Updated last year
- Sample notebooks and prompts for LLM evaluation☆138Updated 3 months ago