zeno-ml / zeno-hubLinks
AI Evaluation Platform
☆46Updated 5 months ago
Alternatives and similar repositories for zeno-hub
Users that are interested in zeno-hub are comparing it to the libraries listed below
Sorting:
- Chat Markup Language conversation library☆55Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 11 months ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Sphynx Hallucination Induction☆53Updated 9 months ago
- An attribution library for LLMs☆46Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated this week
- AI Data Management & Evaluation Platform☆216Updated 2 years ago
- ☆31Updated last year
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Updated last week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆31Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- Synthetic Data for LLM Fine-Tuning☆119Updated last year
- Python library to use Pleias-RAG models☆66Updated 6 months ago
- Open Implementations of LLM Analyses☆107Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- A framework for evaluating function calls made by LLMs☆40Updated last year
- ☆45Updated 2 years ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Verbosity control for AI agents☆64Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last week
- ☆51Updated 9 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆125Updated last week
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆63Updated 11 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆80Updated 9 months ago
- ☆82Updated this week