zeno-ml / zeno-evals
Visualize OpenAI Evals with Zeno
☆24Updated last year
Alternatives and similar repositories for zeno-evals:
Users that are interested in zeno-evals are comparing it to the libraries listed below
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 6 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆46Updated 8 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 5 months ago
- Replace expensive LLM calls with finetunes automatically☆62Updated last year
- ☆60Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- ☆22Updated last year
- ☆24Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 11 months ago
- ☆51Updated 2 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆32Updated 9 months ago
- ☆57Updated last year
- Tools for formatting large language model prompts.☆12Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 11 months ago
- ☆37Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 4 months ago
- AI Evaluation Platform☆46Updated this week
- ☆31Updated last year
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆21Updated last month
- ☆26Updated 4 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 4 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year