devichand579 / HPT
code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models
☆20Updated 2 months ago
Alternatives and similar repositories for HPT:
Users that are interested in HPT are comparing it to the libraries listed below
- ☆41Updated 4 months ago
- Knowledge Unlearning for Large Language Models☆25Updated this week
- ☆19Updated last month
- ☆28Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 4 months ago
- ☆63Updated last month
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆20Updated last month
- ☆45Updated 7 months ago
- ☆20Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 2 weeks ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated 2 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated 6 months ago
- ☆56Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆18Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with Azure OpenAI and 🌐 web browser control via Playwright.☆19Updated last month
- ☆13Updated 4 months ago
- Measuring RAG solutions throughput and latency☆17Updated 9 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 8 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆33Updated last month
- The Swarm Ecosystem☆20Updated 9 months ago
- ☆24Updated 3 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆51Updated last month
- ☆15Updated last month
- The original Shared Recurrent Memory Transformer implementation☆24Updated 3 months ago
- ☆9Updated last year