langwatch / langevals
LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.
☆46Updated this week
Alternatives and similar repositories for langevals:
Users that are interested in langevals are comparing it to the libraries listed below
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆122Updated 5 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- ☆75Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆64Updated 4 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated last week
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆80Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆77Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- The next evolution of Agents☆48Updated last week
- Routing on Random Forest (RoRF)☆135Updated 6 months ago
- ☆92Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Reactive DDD with DSPy☆22Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆83Updated last week
- Build reliable, secure, and production-ready AI apps easily.☆68Updated this week
- ☆118Updated 3 weeks ago
- AI real estate agent☆34Updated last year
- A Python package to dynamically load functions for OpenAI Assistant☆54Updated last year
- ☆45Updated 11 months ago
- Generate Tools and Toolkits from any Python SDK -- no extra code required☆50Updated 4 months ago
- ☆57Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆90Updated 5 months ago
- Natural Language Interfaces Powered by LLMs☆90Updated 7 months ago
- Record and replay LLM interactions for langchain☆80Updated 8 months ago
- ☆88Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated 11 months ago
- Testing and evaluation framework for voice agents☆98Updated last month
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year