gaudiy / langsmith-evaluation-helper
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
β28Updated 2 months ago
Alternatives and similar repositories for langsmith-evaluation-helper:
Users that are interested in langsmith-evaluation-helper are comparing it to the libraries listed below
- π€ A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.β55Updated 9 months ago
- β49Updated last year
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistralβ17Updated 2 months ago
- Benchmark for Japanese document embedding & vector searchβ28Updated 11 months ago
- β35Updated last year
- Japanese LLaMa experimentβ52Updated 3 months ago
- β25Updated 4 months ago
- β75Updated last week
- codeinterpreter-api with Streamlitβ39Updated last year
- [ICLR 2025] SDTT: a simple and effective distillation method for discrete diffusion modelsβ17Updated last month
- A collection of AI Agents papers (Updated biweekly)β42Updated this week
- β49Updated 11 months ago
- Terraform template for Dify on AWSβ36Updated 6 months ago
- Model Context Protocol Server for Safely Executing Pre-approved Commandsβ15Updated 2 months ago
- Pengunuity is an experimental open-source autonomous AI Agent that aims to replicate the structure of human memory.β60Updated last year
- Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness.β70Updated this week
- Python + Markdown framework for building internal apps.β90Updated this week
- Kannon is a wrapper for the gokart library that allows gokart tasks to be easily executed in a distributed and parallel manner on multiplβ¦β25Updated last month
- Simple, Pythonic building blocks to evaluate LLM applications.β212Updated 3 weeks ago
- β47Updated 2 months ago
- β9Updated last year
- β22Updated last year
- Useful tool to build multi-agent in an easy wayβ65Updated 3 weeks ago
- Personal collection of sample code and experiments using the Swarm multi-agent framework.β10Updated 4 months ago
- β26Updated 8 months ago
- β14Updated 3 months ago
- β16Updated 2 months ago
- β14Updated 6 months ago