devichand579 / HPTLinks
code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles
☆23Updated 4 months ago
Alternatives and similar repositories for HPT
Users that are interested in HPT are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- ☆40Updated 11 months ago
- ☆55Updated last year
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆35Updated 4 months ago
- ☆51Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated 11 months ago
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 4 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- ☆67Updated 8 months ago
- ☆102Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- ☆62Updated 5 months ago
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- ☆33Updated 3 weeks ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 7 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆69Updated 5 months ago
- ☆32Updated last year
- ☆20Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated last year
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆56Updated this week
- ☆24Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated 3 weeks ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 11 months ago
- Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"☆58Updated 9 months ago
- ☆28Updated 8 months ago
- ☆61Updated 11 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆23Updated last year