Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.
☆52Jul 10, 2024Updated last year
Alternatives and similar repositories for PHUDGE
Users that are interested in PHUDGE are comparing it to the libraries listed below
Sorting:
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Chat client for LLMs.☆15Jul 23, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 5 months ago
- Realign is a testing and simulation framework for AI applications.☆18Dec 4, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- A bot that scrapes your jobs in real time, sort them according to preferences and runs an alert☆16Nov 14, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- ☆33Jul 9, 2025Updated 7 months ago
- ☆16Jul 23, 2024Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Feb 12, 2026Updated 3 weeks ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated last year
- Iterate fast on your RAG pipelines