devichand579 / HPTLinks

code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles

☆23

Alternatives and similar repositories for HPT

Users that are interested in HPT are comparing it to the libraries listed below

Sorting:

miralab-ai / autoreason
☆40Updated 7 months ago
agential-ai / agential
🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!
☆52Updated 2 weeks ago
zjunlp / OneEdit
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆19Updated 9 months ago
du-nlp-lab / MLR-Copilot
☆66Updated 3 months ago
Bessouat40 / RAGLight
RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…
☆30Updated 4 months ago
padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 9 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆45Updated last year
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆27Updated 7 months ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
dinobby / MAgICoRE
☆24Updated 10 months ago
PathOnAIOrg / LiteMultiAgent
The Library for LLM-based multi-agent applications
☆87Updated this week
Tebmer / Rereading-LLM-Reasoning
EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…
☆26Updated 7 months ago
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
LaVieEnRose365 / AutoGraph
☆18Updated 2 weeks ago
google-deepmind / llms_can_learn_rules
☆57Updated 7 months ago
TergelMunkhbat / concise-reasoning
Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models
☆39Updated 3 months ago
arcee-ai / DAM
☆53Updated 8 months ago
yueqis / API-Based-Agent
☆54Updated 3 weeks ago
DeepSoftwareAnalytics / Awesome-Agent4SE
☆96Updated 10 months ago
sony / talkhier
Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"
☆55Updated 5 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆65Updated 3 months ago
rhyang2021 / SELFGOAL
Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".
☆68Updated last year
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
govtech-responsibleai / KnowOrNot
☆19Updated 3 weeks ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆75Updated this week
sunblaze-ucb / AgentSynth
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆25Updated last month
google-deepmind / latent-multi-hop-reasoning
[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?
☆71Updated 4 months ago
THU-KEG / Agentic-Reward-Modeling
[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
☆97Updated last month
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 10 months ago