ibm-self-serve-assets/JudgeIt-LLM-as-a-Judge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ibm-self-serve-assets/JudgeIt-LLM-as-a-Judge)

ibm-self-serve-assets / JudgeIt-LLM-as-a-Judge

Automation Framework using LLM-as-a-judge to evaluate of Agentic AI, RAG, Text2SQL at scale; that is a good proxy for human judgement.

☆36

Alternatives and similar repositories for JudgeIt-LLM-as-a-Judge

Users that are interested in JudgeIt-LLM-as-a-Judge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ibm-self-serve-assets / MetaGen-Blended-RAG
View on GitHub
☆17Aug 5, 2025Updated 11 months ago
yhyu / agentic-text2sql
View on GitHub
Agentic RAG for open domain text-to-query
☆16Aug 28, 2025Updated 11 months ago
OpenBMB / RAG-DDR
View on GitHub
This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".
☆23Oct 28, 2024Updated last year
gadm21 / Face-recognition-using-PCA-and-SVD
View on GitHub
In this project, facial recognition algorithm is implemented with python using PCA and SVD dimensionality reduction tools.
☆11Sep 2, 2019Updated 6 years ago
Bayer-Group / text-to-sql-epi-ehr-naacl2024
View on GitHub
Code for Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records
☆24May 15, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mettamind-ai / physics_of_llms
View on GitHub
Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)
☆11Oct 21, 2024Updated last year
Cen-Jipeng-SUDA / SQLFixAgent
View on GitHub
The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…
☆27May 2, 2025Updated last year
ProtonX-AI-for-Devs-01 / quang-le-vietnamese-rag
View on GitHub
☆13Oct 6, 2024Updated last year
RUCKBReasoning / DPO_Text2SQL
View on GitHub
[ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
☆16Oct 9, 2025Updated 9 months ago
Frank-qlu / GNN_latest_papers
View on GitHub
The latest research papers on graph representation learning[图神经网络最新论文汇总，持续更新中，欢迎关注]
☆10Jan 7, 2025Updated last year
jensenw1 / RETQA
View on GitHub
A Large-Scale Open-Domain Tabular Question Answering Dataset for the Real Estate Sector
☆17Jun 26, 2025Updated last year
DAMO-NLP-SG / SSTuning
View on GitHub
Code for ACL paper "Zero-Shot Text Classification via Self-Supervised Tuning"
☆29Sep 25, 2023Updated 2 years ago
MarkPhamm / skytrax_reviews
View on GitHub
End-to-end ELT pipeline for 160K+ Skytrax airline reviews: Airflow orchestration, BeautifulSoup scraping, S3 staging, Snowflake wareho…
☆12Updated this week
iohub / OpenCopilot
View on GitHub
Copilot with deepseek and more...
☆13Mar 7, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zillow / compliant-real-estate-chatbot
View on GitHub
Code and experiments for the paper "A recipe for building a compliant real estate chatbot"
☆19Mar 21, 2025Updated last year
IBM / dsce-sample-apps
View on GitHub
Demo apps using the IBM Building Blocks. Demos: https://dsce.ibm.com. Docs: https://ibm.github.io/dsce-sample-apps/
☆47Updated this week
X-LANCE / text2sql-GPT
View on GitHub
[EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
☆23Jan 11, 2024Updated 2 years ago
hungson175 / shared_fin_bot
View on GitHub
Long term stock (VN100
☆19Mar 19, 2025Updated last year
rikzze1 / garbage-classification
View on GitHub
Python notebook about garbage detection based on convolutional neural network
☆17Sep 11, 2025Updated 10 months ago
saminkhan1 / real-estate-ai-agent
View on GitHub
Full-stack LangGraph real-estate agent for lead intake, property search, Google Calendar scheduling, chat, SMS, and voice workflows
☆31May 21, 2026Updated 2 months ago
VimalWill / Vstream
View on GitHub
Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)
☆10Feb 2, 2024Updated 2 years ago
zhangzhejian / quantgpt-agents
View on GitHub
A Stock Price prediction system using LLM and Multi-agent-system
☆27Oct 24, 2023Updated 2 years ago
aaronabentheuer / Cultivator
View on GitHub
Speculative design project about bioprinting of meat and how it could end up in the kitchen of tomorrow.
☆12May 25, 2015Updated 11 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Eric-Chung-0511 / Learning-Record
View on GitHub
Empowering Data Driven insights through hands-on projects, SQL challenges and practical tools.
☆24Jul 11, 2026Updated 2 weeks ago
casetext / r-and-r
View on GitHub
Code for the "Long Context Needs Some R&R" paper.
☆12Mar 11, 2024Updated 2 years ago
mahyar-jahaninasab / Feature-Enforcing-PINN
View on GitHub
Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).
☆13Oct 14, 2024Updated last year
ai-wand / concise-reasoning
View on GitHub
Concise Reasoning via Reinforcement Learning
☆13Apr 16, 2025Updated last year
aymenfurter / smartrag
View on GitHub
Deep Research through Multi-Agents, using GraphRAG
☆87Aug 21, 2025Updated 11 months ago
davide-coccomini / Deepfake-Detection-Challenge-DFAD2023
View on GitHub
Implementation of the winning solution for the Media Analytics Challenge 2023.
☆25Jan 31, 2024Updated 2 years ago
NoeSamaille / continue-watsonx
View on GitHub
CustomLLM config to leverage watsonx LLMs with continue.dev.
☆17Aug 27, 2024Updated last year
aws-samples / llmops-workshop
View on GitHub
☆24Dec 12, 2024Updated last year
wangjx22 / CLAPS
View on GitHub
☆10Mar 6, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
IBM / sql-rl-gen
View on GitHub
The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …
☆25Sep 18, 2025Updated 10 months ago
init0xyz / AdaCQR
View on GitHub
Implementation of AdaCQR(COLING 2025)
☆15Dec 30, 2024Updated last year
moshe / abache
View on GitHub
HTTP server implemented in around 50 lines of bash
☆19Aug 8, 2015Updated 10 years ago
roee30 / miniloop
View on GitHub
A minimal event loop implementation
☆13Nov 27, 2023Updated 2 years ago
quereste / deepfake-for-the-good
View on GitHub
Official repository of paper "Deepfake for the Good: Generating Avatars through Face-Swapping with Implicit Deepfake Generation"
☆37Aug 4, 2025Updated 11 months ago
ashishtiwari1993 / langchain-elasticsearch-RAG
View on GitHub
Example of Langchain-Elasticsearch integrations & RAG.
☆12Sep 20, 2024Updated last year
bigginlab / aescore
View on GitHub
Learning Protein-Ligand Properties with Atomic Environment Vectors
☆10Apr 19, 2024Updated 2 years ago