zeno-ml / zeno-hubLinks
AI Evaluation Platform
☆46Updated 5 months ago
Alternatives and similar repositories for zeno-hub
Users that are interested in zeno-hub are comparing it to the libraries listed below
Sorting:
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Sphynx Hallucination Induction☆53Updated 9 months ago
- Small, simple agent task environments for training and evaluation☆18Updated last year
- Chat Markup Language conversation library☆55Updated last year
- An attribution library for LLMs☆43Updated last year
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆63Updated 10 months ago
- ☆80Updated 2 weeks ago
- A framework for evaluating function calls made by LLMs☆40Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆42Updated last year
- ☆44Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- ☆50Updated 8 months ago
- ☆46Updated 2 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- Verbosity control for AI agents☆65Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆125Updated 2 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆88Updated last month
- ☆31Updated 11 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆105Updated last month
- Explore the use of DSPy for extracting features from PDFs 🔎☆48Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆59Updated 5 months ago
- Tools to make language models a bit easier to use☆54Updated last month
- Synthetic Data for LLM Fine-Tuning☆119Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆52Updated last year