EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other large language models by supporting users in iteratively refining evaluation criteria in a web-based user experience.
☆95Nov 28, 2025Updated 3 months ago
Alternatives and similar repositories for eval-assist
Users that are interested in eval-assist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Long-form factuality assessor for large language models☆29Mar 20, 2026Updated last week
- The Granite Guardian models are designed to detect risks in prompts and responses.☆136Mar 11, 2026Updated 2 weeks ago
- The repo consists of a Python package that works with functional data. In particular, it includes two distinct methodologies: Functional …☆13Sep 18, 2025Updated 6 months ago
- A framework for agentic tool use training with reinforcement learning☆163Jan 5, 2026Updated 2 months ago
- Ontology representing a 360-view of a person (or cohort) that spans across multiple domains, from health to social.☆35Sep 17, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 6 months ago
- Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation re…☆43Updated this week
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projections☆13Nov 1, 2024Updated last year
- Synthetic Text Dataset Generation for LLM projects☆56Mar 10, 2026Updated 2 weeks ago
- ☆23Jun 5, 2025Updated 9 months ago
- Centralize and streamline ML/AI lifecycle observability and compliance processes.☆12Feb 12, 2025Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆222Jan 24, 2025Updated last year
- Independent evaluation set construction for trustworthy ML models in biochemistry☆16Mar 20, 2026Updated last week
- Open source no-code system for text annotation and building of text classifiers☆271May 26, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🚀 Guardrails orchestration server for application of various detections on text generation input and output.☆30Updated this week
- Prompt Declaration Language (PDL) is a declarative prompt programming language.☆288Updated this week
- Efficient and scalable zero-shot entity linking☆110Mar 16, 2026Updated last week
- This repo contains the dataset for the EMNLP 2022 paper "Why Do You Feel This Way? Summarizing Triggers of Emotions in Social Media Posts…☆19Oct 9, 2023Updated 2 years ago
- QT-DOG: QUANTIZATION-AWARE TRAINING FOR DOMAIN GENERALIZATION☆24Nov 30, 2025Updated 3 months ago
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 6 months ago
- A web app for rapidly prototyping AI agents and the lightweight web UIs that wrap them—build flows, preview interactions, and share agent…☆63Mar 19, 2026Updated last week
- Examples using the Deep Search functionalities☆85Jan 29, 2025Updated last year
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- ☆33Mar 9, 2026Updated 2 weeks ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆211Feb 16, 2026Updated last month
- ☆14Jun 25, 2025Updated 9 months ago
- ☆27Sep 1, 2024Updated last year
- ☆271Jun 25, 2025Updated 9 months ago
- graph2mat: Graph to matrix conversion☆21Jan 19, 2026Updated 2 months ago
- ☆12Dec 8, 2022Updated 3 years ago
- [ICLR 2025] General-purpose activation steering library☆153Sep 18, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Docker container for UDPipe (https://github.com/ufal/udpipe) REST server.☆12Jun 23, 2020Updated 5 years ago
- AMR Parsing via Graph-Sequence Iterative Inference☆70Jun 12, 2023Updated 2 years ago
- eTaPR☆16May 16, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 6 months ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Jan 9, 2023Updated 3 years ago
- ☆18Mar 20, 2022Updated 4 years ago