devichand579 / HPTLinks
code for Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models
☆21Updated 3 weeks ago
Alternatives and similar repositories for HPT
Users that are interested in HPT are comparing it to the libraries listed below
Sorting:
- ☆50Updated this week
- ☆41Updated 5 months ago
- ☆21Updated 3 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆24Updated 8 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆18Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 5 months ago
- ☆49Updated 6 months ago
- ☆20Updated last month
- ☆65Updated 2 months ago
- ☆24Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆24Updated 2 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- ☆46Updated 8 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆31Updated 3 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- ☆17Updated 5 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆12Updated last year
- Code repo for MathAgent☆16Updated last year
- ☆16Updated 3 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆67Updated 2 months ago
- ☆13Updated 5 months ago