UW-Madison-Lee-Lab / LLM-judge-reportingView external linksLinks
A simple plug-in framework that corrects bias and computes confidence intervals in reporting LLM-as-a-judge evaluation, and an adaptive algorithm that efficiently allocates calibration samples to reduce uncertainty in estimates.
☆69Nov 27, 2025Updated 2 months ago
Alternatives and similar repositories for LLM-judge-reporting
Users that are interested in LLM-judge-reporting are comparing it to the libraries listed below
Sorting:
- Python powered music controlling webpage with websockets and bottle py (works with spotify, vlc, audacious, and others)☆11Jun 9, 2017Updated 8 years ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 2 months ago
- ☆12Aug 6, 2024Updated last year
- An embedded Rust IDE with an emphasis on a fun and insightful coding experience☆11Sep 23, 2024Updated last year
- A vanilla implementation of ReAct: Synergizing Reasoning and Acting in Language Models☆15Mar 26, 2025Updated 10 months ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 6 months ago
- Fast, gpu-accelerated distance transforms☆13Mar 7, 2025Updated 11 months ago
- Document Drivien Development☆18Nov 9, 2025Updated 3 months ago
- HelTomo - Helsinki Tomography Toolbox☆11Aug 5, 2022Updated 3 years ago
- ☆14Dec 12, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025☆33Nov 15, 2025Updated 3 months ago
- ☆20Jul 23, 2025Updated 6 months ago
- ☆14Mar 8, 2025Updated 11 months ago
- CUDA implementation of Meijster's parallel algorithm for calculating the distance transform of a 2D image☆12Mar 18, 2019Updated 6 years ago
- Evolutionary Merge Experiment☆47Jun 10, 2024Updated last year
- Copyright-free Artificial Lyrics Dataset (ISMIR 2024 LBD)☆12Sep 1, 2024Updated last year
- Bayesian Optimization with multiple tiers of objectives, which can flexibly depend on model inputs and outputs☆11Sep 10, 2025Updated 5 months ago
- ☆13Aug 5, 2025Updated 6 months ago
- A lightweight, type-safe workflow engine for TypeScript that helps you create flexible, graph-based execution flows☆25Jun 24, 2025Updated 7 months ago
- Complex Reasoning with ReAct and LangChain☆12Apr 24, 2024Updated last year
- Unofficial client for Actual Budget - supporting Android and JVM desktop☆20Updated this week
- Code for the paper "Exploiting Pretrained Biochemical Language Models for Targeted Drug Design", to appear in Bioinformatics, Proceedings…☆17Feb 26, 2024Updated last year
- No more 8 hour sessions trying to get cuda to compile☆11Feb 16, 2025Updated 11 months ago
- A Deno-based CLI tool to recursively find and display TODOs in your project☆18Jun 19, 2025Updated 7 months ago
- ☆14Nov 27, 2023Updated 2 years ago
- ASTRA implementation of the simultaneous algebraic reconstruction technique (SART) with superiorization☆15Jun 28, 2024Updated last year
- ☆16May 31, 2024Updated last year
- a decentralized dataset generator and manipulator.☆13Updated this week
- VTK for rust, proof of concept.☆12May 14, 2016Updated 9 years ago
- A collection of fast FISTA-type algorithms for MRI.☆13Dec 18, 2023Updated 2 years ago
- Touch camera for Bevy that supports drag and pinch to zoom☆13Feb 21, 2024Updated last year
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- ☆13Oct 31, 2024Updated last year
- Official examples for quickshell☆31Jul 28, 2025Updated 6 months ago
- Simplified class for Zoltraak, a digital content production framework like program codes, images, speeches, presentations, books and vide…☆16Sep 25, 2024Updated last year
- A Rust library for solving sparse linear systems using direct methods.☆20Oct 13, 2025Updated 4 months ago
- ☆39Sep 7, 2025Updated 5 months ago
- CrysText: A Generative AI Approach for Text-Conditioned Crystal Structure Generation using LLM☆14Nov 3, 2025Updated 3 months ago