gaudiy / langsmith-evaluation-helperLinks
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
☆29Updated last month
Alternatives and similar repositories for langsmith-evaluation-helper
Users that are interested in langsmith-evaluation-helper are comparing it to the libraries listed below
Sorting:
- Simple, Pythonic building blocks to evaluate LLM applications.☆232Updated 2 weeks ago
- ☆61Updated last year
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 6 months ago
- codeinterpreter-api with Streamlit☆39Updated 2 years ago
- ☆26Updated 9 months ago
- Python + Markdown framework for building internal apps.☆104Updated 3 months ago
- ☆27Updated last year
- Benchmark for Japanese document embedding & vector search☆29Updated last year
- ☆49Updated last year
- Evaluating the performance of LLMs on Japanese challenging financial tasks.☆21Updated last week
- Japanese LLaMa experiment☆53Updated 7 months ago
- ☆99Updated 2 months ago
- ☆33Updated this week
- ☆13Updated 4 months ago
- Personal collection of sample code and experiments using the Swarm multi-agent framework.☆10Updated 9 months ago
- ☆16Updated 6 months ago
- ☆26Updated last year
- ☆16Updated 7 months ago
- Project of llm evaluation to Japanese tasks☆86Updated this week
- ☆164Updated 4 months ago
- Useful tool to build multi-agent in an easy way☆65Updated 5 months ago
- ☆275Updated last year
- Anthropic Dev Container Features, including Claude Code CLI☆105Updated last month
- A command-line tool that uses Gemini API to generate summaries of academic papers.☆45Updated 2 months ago
- ☆24Updated last year
- Python library that provides a unified interface for interacting with multiple Large Language Models (LLMs) from different providers.☆19Updated this week
- A Slack MCP server☆81Updated 3 weeks ago
- Pengunuity is an experimental open-source autonomous AI Agent that aims to replicate the structure of human memory.☆61Updated 2 years ago
- Japanese / English Bilingual LLM☆24Updated last week
- A powerful MCP Server that enables AI assistants like Claude to interact with humans through intuitive GUI dialogs. This server bridges t…☆62Updated last month