technion-cs-nlp / llm-arithmetic-heuristics
โ15Updated 3 weeks ago
Alternatives and similar repositories for llm-arithmetic-heuristics:
Users that are interested in llm-arithmetic-heuristics are comparing it to the libraries listed below
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"โ65Updated 8 months ago
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ48Updated 9 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Modelโ42Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"โ34Updated 2 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoningโ47Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"โ54Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentโ54Updated 6 months ago
- โ59Updated 10 months ago
- โ20Updated 9 months ago
- โ45Updated 6 months ago
- Sotopia-ฯ: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)โ58Updated 9 months ago
- โ20Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Modelsโฆโ32Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queriesโ23Updated 8 months ago
- A library for efficient patching and automatic circuit discovery.โ54Updated 2 weeks ago
- Codebase for Instruction Following without Instruction Tuningโ33Updated 5 months ago
- โ40Updated 2 weeks ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformersโ57Updated last month
- โ28Updated last month
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineeringโ49Updated 3 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)โ29Updated last year
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entityโฆโ23Updated 11 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".โ68Updated 11 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.โ78Updated 6 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"โ101Updated 11 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMsโ52Updated 11 months ago
- โ17Updated 4 months ago
- Evaluate the Quality of Critiqueโ35Updated 9 months ago