Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.
☆52Jul 10, 2024Updated last year
Alternatives and similar repositories for PHUDGE
Users that are interested in PHUDGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆12Jun 25, 2024Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- A bot that scrapes your jobs in real time, sort them according to preferences and runs an alert☆16Nov 14, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hugging Face Jobs☆19Jul 11, 2025Updated 8 months ago
- Chat client for LLMs.☆15Jul 23, 2024Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 6 months ago
- ☆42Apr 23, 2024Updated last year
- A workbench application to test out different prompts on a variety of AI models to see how they perform☆16Feb 9, 2025Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Feb 12, 2026Updated last month
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 27, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,060Apr 25, 2025Updated 11 months ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 9 months ago
- Dataiku DSS plugin template with continuous integration. Test your plugins, release them faster ⚡️☆11Sep 23, 2025Updated 6 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆35Aug 21, 2025Updated 7 months ago
- ☆15Jul 15, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Jul 23, 2024Updated last year
- ☆11Jul 16, 2024Updated last year
- Repository for the paper "MALADE: Orchestration of LLM-powered Agents with Retrieval Augmented Generation for Pharmacovigilance"☆23Feb 19, 2025Updated last year
- ☆17Jun 8, 2025Updated 9 months ago
- A minimal viable example for a FastAPI, Celery, and Redis backend. With frontend JavaScript which polls tasks until done.☆12Dec 3, 2020Updated 5 years ago
- ☆25Feb 23, 2026Updated last month
- 🟣 Feature Engineering interview questions and answers to help you prepare for your next machine learning and data science interview in 2…☆16Jan 4, 2026Updated 2 months ago
- A list of resources to learn about data stewardship☆15Dec 2, 2025Updated 3 months ago
- lancedb-myntra-fashion-search☆34Apr 7, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆38Jan 9, 2026Updated 2 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆215Sep 29, 2024Updated last year
- ☆16May 8, 2021Updated 4 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- FlexiTokens☆18Dec 27, 2025Updated 2 months ago
- This project is to list the best books, courses, tutorial, methods on learning certain knowledge☆11Mar 22, 2021Updated 5 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago