gaudiy / langsmith-evaluation-helper
Helper library for LangSmith that provides an interface to run evaluations by simply writing config files.
β23Updated this week
Related projects β
Alternatives and complementary repositories for langsmith-evaluation-helper
- π€ A collection of AI agents includes research papers, blogs, and products focused on developing autonomous systems.β44Updated 5 months ago
- Japanese LLaMa experimentβ52Updated 8 months ago
- GraphAI is an asynchronous data flow execution engine, which allows developers to build agentic applications by describing agent workflowβ¦β169Updated this week
- Benchmark for Japanese document embedding & vector searchβ28Updated 8 months ago
- Terraform configuration for deploying Dify on Google Cloud with scalability, high availability, and production-level readiness.β30Updated this week
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistralβ15Updated last month
- β24Updated 2 weeks ago
- codeinterpreter-api with Streamlitβ39Updated last year
- β31Updated 2 weeks ago
- γ2024εΉ΄ηγBERTγ«γγγγγΉγει‘β24Updated 4 months ago
- useful tool to use API-based LLM in an easy wayβ59Updated last month
- A Slack Bot for summarizing arXiv papers, powered by OpenAI LLMs.β68Updated last year
- β14Updated 2 months ago
- β13Updated 2 months ago
- β48Updated 7 months ago
- β22Updated 11 months ago
- SDTT: a simple and effective distillation method for discrete diffusion modelsβ16Updated last week
- FAST is an annotation tool that focuses on mobile devices. https://aclanthology.org/2021.emnlp-demo.41/β53Updated 3 years ago
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalitiesβ48Updated 8 months ago
- β37Updated 3 months ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.β13Updated last year
- β51Updated 5 months ago
- Japanese Language Model Financial Evaluation Harnessβ65Updated 3 weeks ago
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentationβ21Updated 6 months ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.β118Updated 3 weeks ago
- AI pipeline to generate Talk-to-the-City reportsβ53Updated 2 months ago
- β17Updated 10 months ago
- Project of llm evaluation to Japanese tasksβ77Updated this week
- Terraform template for Dify on AWSβ30Updated 2 months ago
- β41Updated 9 months ago