gaudiy / langsmith-evaluation-helper
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
☆27Updated 3 weeks ago
Alternatives and similar repositories for langsmith-evaluation-helper:
Users that are interested in langsmith-evaluation-helper are comparing it to the libraries listed below
- 🤖 A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.☆52Updated 7 months ago
- ☆44Updated 2 weeks ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 2 weeks ago
- Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness.☆45Updated 2 months ago
- Japanese LLaMa experiment☆52Updated last month
- codeinterpreter-api with Streamlit☆39Updated last year
- ☆26Updated 6 months ago
- Personal collection of sample code and experiments using the Swarm multi-agent framework.☆10Updated 3 months ago
- Benchmark for Japanese document embedding & vector search☆28Updated 10 months ago
- ☆14Updated 4 months ago
- ☆49Updated 9 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- ☆41Updated last year
- MCP Server implementation for Claude☆19Updated last month
- Simple, Pythonic building blocks to evaluate LLM applications.☆204Updated last month
- ☆22Updated last year
- ☆41Updated 11 months ago
- SDTT: a simple and effective distillation method for discrete diffusion models☆16Updated last week
- ☆56Updated 7 months ago
- Claude3のマルチモーダル機能を用いてmp4の動画をプロンプトによる解析をします。☆15Updated 10 months ago
- ☆15Updated 10 months ago
- Terraform template for Dify on AWS☆35Updated 4 months ago
- Project of llm evaluation to Japanese tasks☆79Updated last month
- ☆38Updated 5 months ago
- ☆15Updated 3 weeks ago
- Evolutionary Merge Experiment☆32Updated 7 months ago
- A simple coding assistant app powered by Microsoft guidance, OpenAI LLMs, and Gradio.☆34Updated last year
- AI pipeline to generate Talk-to-the-City reports☆79Updated 5 months ago
- ☆13Updated this week
- ☆26Updated 8 months ago