javyduck / KnowHalu
β47Updated 10 months ago
Alternatives and similar repositories for KnowHalu:
Users that are interested in KnowHalu are comparing it to the libraries listed below
- Mixing Language Models with Self-Verification and Meta-Verificationβ103Updated 4 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β65Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"β54Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ42Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β66Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.β77Updated 7 months ago
- The first dense retrieval model that can be prompted like an LMβ70Updated 7 months ago
- β41Updated 4 months ago
- β74Updated 3 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environmentsβ51Updated last month
- β45Updated 6 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ77Updated 6 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ105Updated last week
- β46Updated last week
- Small and Efficient Mathematical Reasoning LLMsβ71Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.β89Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 6 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β106Updated 7 months ago
- LLM reads a paper and produce a working prototypeβ52Updated last week
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."β64Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"β104Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ55Updated 7 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuningβ46Updated last year
- β81Updated last year
- β35Updated 9 months ago
- β48Updated 5 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)β75Updated last month
- β50Updated 4 months ago