zeno-ml / zeno-hubLinks
AI Evaluation Platform
☆47Updated 8 months ago
Alternatives and similar repositories for zeno-hub
Users that are interested in zeno-hub are comparing it to the libraries listed below
Sorting:
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Chat Markup Language conversation library☆55Updated 2 years ago
- An attribution library for LLMs☆46Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- ☆53Updated 11 months ago
- ☆91Updated last month
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Updated 2 years ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆45Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- LLM finetuning☆42Updated 2 years ago
- Sphynx Hallucination Induction☆52Updated last year
- ☆55Updated last year
- Open Implementations of LLM Analyses☆107Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Automating enterprise workflows with multimodal agents☆114Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 4 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- Python library to use Pleias-RAG models☆68Updated 8 months ago