EvalAssist is an open-source project that simplifies using large language models as evaluators (LLM-as-a-Judge) of the output of other large language models by supporting users in iteratively refining evaluation criteria in a web-based user experience.
☆97Apr 8, 2026Updated last week
Alternatives and similar repositories for eval-assist
Users that are interested in eval-assist are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Granite Guardian models are designed to detect risks in prompts and responses.☆136Mar 11, 2026Updated last month
- In-Context Explainability 360 toolkit☆66Mar 9, 2026Updated last month
- basic Open WebUI + Ollama stack for Local ChatGPT☆37Jan 28, 2026Updated 2 months ago
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 5 months ago
- Synthetic Text Dataset Generation for LLM projects☆58Apr 5, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆23Jun 5, 2025Updated 10 months ago
- Centralize and streamline ML/AI lifecycle observability and compliance processes.☆12Feb 12, 2025Updated last year
- Code for "Out-of-Distribution Detection using Synthetic Data Generation"☆21Feb 6, 2025Updated last year
- Efficient and scalable zero-shot entity linking☆113Apr 6, 2026Updated last week
- Open source framework for evaluating AI Agents☆29Feb 24, 2026Updated last month
- Neural Unification for Logic Reasoning over Language☆22Nov 15, 2021Updated 4 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆65Feb 6, 2025Updated last year
- Zero and Few shot named entity & relationships recognition☆402Sep 17, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A demonstration of hybrid search with reranking using Qdrant and BGE-M3 model. A showcase of dense and sparse retrieval combined with Col…☆31Apr 4, 2025Updated last year
- Examples using the Deep Search functionalities☆86Jan 29, 2025Updated last year
- ☆10Dec 3, 2024Updated last year
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- 360M model running in the browser on WebGPU☆23Aug 20, 2024Updated last year
- ☆14Apr 8, 2026Updated last week
- Quality of service (QoS) dashboard for Web Infra OSS projects.☆17Updated this week
- ☆10Oct 28, 2020Updated 5 years ago
- ☆272Jun 25, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Aug 3, 2022Updated 3 years ago
- ☆12Jul 6, 2022Updated 3 years ago
- Generate HTML forms from Pydantic models for your FastHTML application☆44Apr 2, 2026Updated 2 weeks ago
- ☆14Dec 1, 2025Updated 4 months ago
- Repo for the Advanced Python Skills course that I created (hosted in Udemy and Skillshare)☆15Nov 1, 2020Updated 5 years ago
- AMR Parsing via Graph-Sequence Iterative Inference☆70Jun 12, 2023Updated 2 years ago
- Chu-Lui-Edmonds decoding extracted from TurboParser☆14May 16, 2017Updated 8 years ago
- A Python SDK for optimizing prompts for Amazon Nova Models.☆55Mar 16, 2026Updated last month
- Appium CDP Driver is a W3C WebDriver that allows you to connect to chromium based android mobile browsers like chrome & samsung browser t…☆10Jul 16, 2025Updated 9 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official Python library for Openlayer, the Continuous Model Improvement Platform for AI. 📈☆16Updated this week
- Contains code used to conduct experiments on dependency parsing with the Tensor-LSTM model developed for our paper "Cross-Lingual Depende…☆13Jan 5, 2017Updated 9 years ago
- Neural Network Based Dependency Parsers☆11Jan 14, 2016Updated 10 years ago
- Parsing only with Pretraining Networks☆16Jul 25, 2024Updated last year
- Hal Daume's hbc☆20Jan 23, 2010Updated 16 years ago
- Python framework which enables you to transform how a user calls or infers an IBM Granite model and how the output from the model is retu…☆57Apr 8, 2026Updated last week
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆16Updated this week