gaudiy / langsmith-evaluation-helperLinks
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
☆29Updated last month
Alternatives and similar repositories for langsmith-evaluation-helper
Users that are interested in langsmith-evaluation-helper are comparing it to the libraries listed below
Sorting:
- Simple, Pythonic building blocks to evaluate LLM applications.☆230Updated last week
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Updated 6 months ago
- ☆61Updated last year
- ☆99Updated 2 months ago
- ☆49Updated last year
- ☆32Updated this week
- ☆27Updated last year
- ☆43Updated 5 months ago
- codeinterpreter-api with Streamlit☆39Updated last year
- Benchmark for Japanese document embedding & vector search☆29Updated last year
- Python + Markdown framework for building internal apps.☆104Updated 2 months ago
- ☆26Updated last year
- ☆15Updated 5 months ago
- Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"☆110Updated 5 months ago
- Repository for MCP screenshot functionality☆16Updated 6 months ago
- Evaluating the performance of LLMs on Japanese challenging financial tasks.☆21Updated last month
- Japanese LLaMa experiment☆53Updated 7 months ago
- AI pipeline to generate Talk-to-the-City reports☆127Updated 5 months ago
- Python library that provides a unified interface for interacting with multiple Large Language Models (LLMs) from different providers.☆18Updated last week
- Japanese / English Bilingual LLM☆23Updated 3 weeks ago
- ☆160Updated 3 months ago
- This Repositry is an experiment with an agent that searches documents and asks questions repeatedly in response to the main question. It …☆18Updated 2 years ago
- wikipedia 日本語の文を、各種日本語の embeddings や faiss index へと変換するスクリプト等。☆11Updated last year
- ☆26Updated 8 months ago
- Useful tool to build multi-agent in an easy way☆65Updated 4 months ago
- ☆35Updated 2 years ago
- Pengunuity is an experimental open-source autonomous AI Agent that aims to replicate the structure of human memory.☆62Updated 2 years ago
- Langtrace SDK for Python Applications☆36Updated last month
- ☆13Updated 3 months ago
- Terraform template for Dify on AWS☆40Updated 10 months ago