gaudiy / langsmith-evaluation-helperLinks
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
☆29Updated 2 months ago
Alternatives and similar repositories for langsmith-evaluation-helper
Users that are interested in langsmith-evaluation-helper are comparing it to the libraries listed below
Sorting:
- Simple, Pythonic building blocks to evaluate LLM applications.☆235Updated last week
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 7 months ago
- ☆39Updated this week
- ☆63Updated last year
- ☆16Updated 7 months ago
- Personal collection of sample code and experiments using the Swarm multi-agent framework.☆10Updated 10 months ago
- ☆13Updated 4 months ago
- ☆27Updated last year
- Benchmark for Japanese document embedding & vector search☆30Updated last year
- Python + Markdown framework for building internal apps.☆106Updated 4 months ago
- Pengunuity is an experimental open-source autonomous AI Agent that aims to replicate the structure of human memory.☆61Updated 2 years ago
- codeinterpreter-api with Streamlit☆39Updated 2 years ago
- Evaluating the performance of LLMs on Japanese challenging financial tasks.☆22Updated 3 weeks ago
- Project of llm evaluation to Japanese tasks☆87Updated this week
- A Slack MCP server☆93Updated this week
- ☆27Updated 9 months ago
- Python library that provides a unified interface for interacting with multiple Large Language Models (LLMs) from different providers.☆19Updated 3 weeks ago
- Japanese LLaMa experiment☆54Updated 8 months ago
- ☆26Updated last year
- Anthropic Dev Container Features, including Claude Code CLI☆125Updated 2 months ago
- ☆112Updated 2 weeks ago
- ☆50Updated last year
- Useful tool to build multi-agent in an easy way☆65Updated 6 months ago
- A Slack bot that lets you chat with 100+ LLMs via LiteLLM.☆26Updated this week
- This Repositry is an experiment with an agent that searches documents and asks questions repeatedly in response to the main question. It …☆18Updated 2 years ago
- Japanese / English Bilingual LLM☆24Updated this week
- ☆274Updated last year
- AWS CDK for Dify☆43Updated 2 weeks ago
- Claude3のマルチモーダル機能を用いてmp4の動画をプロンプトによる解析をします。☆16Updated last year
- Swallowプロジェクト 事後学習ずみ大規模言語モデル 評価フレームワーク☆16Updated this week