Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.
☆51Jul 10, 2024Updated last year
Alternatives and similar repositories for PHUDGE
Users that are interested in PHUDGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.