ibm-self-serve-assets / JudgeIt-LLM-as-a-JudgeView external linksLinks
Automation Framework using LLM-as-a-judge to evaluate of Agentic AI, RAG, Text2SQL at scale; that is a good proxy for human judgement.
☆34Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for JudgeIt-LLM-as-a-Judge
Users that are interested in JudgeIt-LLM-as-a-Judge are comparing it to the libraries listed below
Sorting:
- Agentic RAG for open domain text-to-query☆16Aug 28, 2025Updated 5 months ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆28Dec 18, 2024Updated last year
- Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"☆28Sep 25, 2023Updated 2 years ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated 2 weeks ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 3 months ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Memory Topology for GPUs☆17Updated this week
- ext_mpi_collectives☆11Apr 1, 2025Updated 10 months ago
- ☆39Feb 9, 2026Updated last week
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆25Feb 4, 2026Updated last week
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 5 months ago
- ☆11Jun 16, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- ☆14May 1, 2023Updated 2 years ago
- ☆13Jan 16, 2025Updated last year
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- Performance Counter Reader☆11Sep 14, 2022Updated 3 years ago
- ☆11Feb 27, 2024Updated last year
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago
- How to build an ACP compliant agent that uses MCP as well!☆11May 6, 2025Updated 9 months ago
- GPU based 2D elastic FWI☆11Mar 6, 2018Updated 7 years ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- A comprehensive ELT pipeline for analyzing passenger satisfaction data. Features a modern data architecture with Apache Airflow for extra…☆12Oct 5, 2025Updated 4 months ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated last month
- yolo目标检测算法☆15Jul 27, 2025Updated 6 months ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- pdf to markdown with Python3☆11Oct 30, 2019Updated 6 years ago
- 2D time-domain isotropic (visco)elastic FD modeling and full waveform inversion (FWI) code for SH-waves☆13Aug 9, 2020Updated 5 years ago
- Sequential Parameter Optimization in Python☆14Jan 12, 2026Updated last month
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated last year
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 9 months ago
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- Build tools for Open-CE☆13Nov 13, 2025Updated 3 months ago
- An example project demonstrating one approach to GraphRAG☆14Sep 13, 2024Updated last year
- Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures☆11Apr 23, 2019Updated 6 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).☆13Oct 14, 2024Updated last year
- ☆10Mar 6, 2023Updated 2 years ago