haizelabs / Awesome-LLM-JudgesView external linksLinks
⚖️ Awesome LLM Judges ⚖️
☆174Apr 28, 2025Updated 9 months ago
Alternatives and similar repositories for Awesome-LLM-Judges
Users that are interested in Awesome-LLM-Judges are comparing it to the libraries listed below
Sorting:
- Inference-time scaling for LLMs-as-a-judge.☆329Nov 5, 2025Updated 3 months ago
- Mine-tuning is a methodology for synchronizing human and AI attention.☆19Jun 16, 2024Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- ☆28Apr 2, 2025Updated 10 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- The AI Adoption and Management Framework (AI-AMF) is a structured methodology designed to help organizations successfully integrate artif…☆14Feb 18, 2025Updated 11 months ago
- Repository for "Training Language Models To Explain Their Own Computations"☆20Dec 22, 2025Updated last month
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 3 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 6 months ago
- we got you bro☆37Jul 29, 2024Updated last year
- ☆17May 8, 2024Updated last year
- The SDK interface to Letta Code. Build deeply personalized agents with persistent memory that learn over time.☆43Feb 11, 2026Updated last week
- Natural Language is All a Graph Needs - LLM / Graph AI / Knowledge Graph - Experiments☆38Sep 27, 2023Updated 2 years ago
- This curated list focuses on tools and frameworks for building AI agents☆27Jan 31, 2026Updated 2 weeks ago
- Simple utility for embedding files/resources inside golang binaries☆20Feb 20, 2021Updated 4 years ago
- ☆67May 23, 2025Updated 8 months ago
- A curated list of AI agents, frameworks, and tools that automate tasks, enhance workflows, and push the boundaries of artificial intellig…☆22Mar 2, 2025Updated 11 months ago
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆19Mar 3, 2025Updated 11 months ago
- Personal project, Generative AI, Streamlit, Python☆54Apr 30, 2025Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 3 months ago
- PromptMII: Meta-Learning Instruction Induction for LLMs☆46Jan 12, 2026Updated last month
- Fluent student-teacher redteaming☆23Jul 25, 2024Updated last year
- ☆29Jul 6, 2023Updated 2 years ago
- A tool for AI agents to discover and learn skills autonomously☆183Feb 8, 2026Updated last week
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 7 months ago
- Codebase for Obfuscated Activations Bypass LLM Latent-Space Defenses☆28Feb 11, 2025Updated last year
- Autoregressive Image Generation☆31Jun 13, 2025Updated 8 months ago
- The semantic layer for software engineering: Connect code to meaning, build on understanding☆37Apr 17, 2025Updated 10 months ago
- ☆27Oct 22, 2024Updated last year
- Prompt design in Python☆65Nov 27, 2024Updated last year
- Letting Claude Code develop his own MCP tools :)☆123Mar 8, 2025Updated 11 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated last month
- A reading list of relevant papers and projects on foundation model annotation☆28Feb 27, 2025Updated 11 months ago
- ☆37May 5, 2025Updated 9 months ago
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆758Jun 2, 2025Updated 8 months ago
- Go language binding for Kùzu graph database management system☆42Oct 10, 2025Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Mar 18, 2025Updated 10 months ago
- 🧠 Universal semantic indexer providing persistent memory for Claude Code through knowledge graphs, Tree-sitter parsing, and Qdrant vec…☆71Jul 31, 2025Updated 6 months ago