devichand579 / HPTLinks
code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles
☆23Updated last month
Alternatives and similar repositories for HPT
Users that are interested in HPT are comparing it to the libraries listed below
Sorting:
- ☆40Updated 9 months ago
- ☆11Updated 10 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 9 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆31Updated 2 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Updated 11 months ago
- ☆56Updated 2 months ago
- LLM reads a paper and produce a working prototype☆56Updated 5 months ago
- ☆54Updated 10 months ago
- Multi-Granularity LLM Debugger☆90Updated 2 months ago
- ☆67Updated 5 months ago
- The Library for LLM-based multi-agent applications☆90Updated 2 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 9 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆30Updated last month
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆15Updated 5 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆57Updated 6 months ago
- ☆99Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆43Updated 8 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 2 months ago
- Open Implementations of LLM Analyses☆107Updated 11 months ago
- ☆59Updated 9 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆58Updated 2 months ago
- ☆50Updated 11 months ago
- ☆23Updated last year
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆82Updated last week
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆93Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- ☆23Updated 7 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆28Updated 5 months ago