langwatch / langevalsLinks

LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.

☆63

Alternatives and similar repositories for langevals

Users that are interested in langevals are comparing it to the libraries listed below

Sorting:

CYQIQ / MultiCoT
Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph
☆147Updated last year
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆76Updated 9 months ago
ammirsm / llamaindex-omakase-rag
This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…
☆146Updated last year
anyscale / llm-router
Tutorial for building LLM router
☆220Updated last year
diicellman / dspy-rag-fastapi
FastAPI wrapper around DSPy
☆258Updated last year
weaviate / gorilla
Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.
☆133Updated last month
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆181Updated 10 months ago
BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆150Updated 6 months ago
seanchatmangpt / dspygen
A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
☆126Updated 9 months ago
agentsea / surfkit
A toolkit for building computer use AI agents
☆170Updated last month
bradAGI / GraphMemory
GraphRAG database - hybrid graph / vector db
☆127Updated 10 months ago
darshil3011 / AutoMetaRAG
Dynamic Metadata based RAG Framework
☆75Updated last year
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
whyhow-ai / whyhow
Automated knowledge graph creation SDK
☆122Updated 8 months ago
topoteretes / awesome-ai-memory
A list of AI memory projects
☆185Updated 6 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
jmanhype / dspy-self-discover-framework
Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…
☆67Updated last year
zhudotexe / redel
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆83Updated 4 months ago
RasaHQ / calm-langgraph-customer-service-comparison
A reimplementation of langgraph's customer support example in Rasa's CALM paradigm and a quantiative evaluation of the 2 approaches
☆80Updated 4 months ago
qdrant / qdrant-rag-eval
This repo is the central repo for all the RAG Evaluation reference material and partner workshop
☆73Updated 3 months ago
Barneyjm / langchain-autotools
Generate Tools and Toolkits from any Python SDK -- no extra code required
☆53Updated 9 months ago
whyhow-ai / rule-based-retrieval
The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…
☆246Updated 10 months ago
hwchase17 / langfuzz
☆71Updated 9 months ago
marcusschiesser / open-swarm
Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.
☆92Updated 9 months ago
garg-ankush / scipe
SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.
☆25Updated 9 months ago
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆87Updated 10 months ago
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 5 months ago
datastax / ai-chatbot-starter
A starter app to build AI powered chat bots with Astra DB and LlamaIndex
☆74Updated last year
hammer-mt / DSPyUI
A user interface for DSPy
☆166Updated 2 months ago
run-llama / llamaindex_aws_ingestion
☆89Updated last year