deshwalmahesh / PHUDGEView external linksLinks
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.
☆51Jul 10, 2024Updated last year
Alternatives and similar repositories for PHUDGE
Users that are interested in PHUDGE are comparing it to the libraries listed below
Sorting:
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- REBUS: A Robust Evaluation Benchmark of Understanding Symbols☆13Aug 13, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- A bot that scrapes your jobs in real time, sort them according to preferences and runs an alert☆16Nov 14, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- Hugging Face Jobs☆19Jul 11, 2025Updated 7 months ago
- ☆42Apr 23, 2024Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Mar 4, 2025Updated 11 months ago
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Mar 27, 2024Updated last year
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated this week
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆42Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,043Apr 25, 2025Updated 9 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- Simple Streamlit UI for Ollama☆21May 13, 2024Updated last year
- 🤝 Trade any tensors over the network☆31Sep 27, 2023Updated 2 years ago
- Llama cute voice assistant☆27Sep 10, 2023Updated 2 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- ☆30Oct 4, 2024Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,555Jan 14, 2026Updated last month
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆32May 1, 2025Updated 9 months ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 3 months ago
- ☆29Apr 23, 2025Updated 9 months ago
- benchmarks for LLM tokenizers☆16Jan 15, 2026Updated 3 weeks ago
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- A strongly typed Python DSL for developing message passing multi agent systems☆53Apr 9, 2024Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆215Sep 29, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- A frontend interface for interacting with AI Models. Compatible with Ollama and OpenAI☆10May 1, 2025Updated 9 months ago
- A Model Context Protocol server that provides documentation access capabilities. This server enables LLMs to search and retrieve content …☆18Apr 29, 2025Updated 9 months ago
- ☆11Mar 11, 2024Updated last year
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Python library for the enigma machine☆16Mar 7, 2024Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Mar 30, 2025Updated 10 months ago
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year