yizhongw/truthfulqa_reeval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yizhongw/truthfulqa_reeval)

yizhongw / truthfulqa_reeval

☆12

Alternatives and similar repositories for truthfulqa_reeval

Users that are interested in truthfulqa_reeval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

THU-KEG / LRM-FactEval
View on GitHub
☆16Jun 25, 2025Updated 11 months ago
d223302 / Over-Reasoning-of-LLMs
View on GitHub
Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models
☆11Jan 23, 2024Updated 2 years ago
mt-upc / transformer-contributions-nmt
View on GitHub
☆18Oct 6, 2022Updated 3 years ago
NoamAndRoy / JargonProject
View on GitHub
☆13Mar 16, 2026Updated 2 months ago
UpstageAI / evalverse-IFEval
View on GitHub
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆14May 4, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
deep-spin / lmt_hallucinations
View on GitHub
☆18Jun 13, 2023Updated 2 years ago
charlestudor / PokerNowLogConverter
View on GitHub
A simple CLI tool for converting logs from Poker Now games to the PokerStars format.
☆18Sep 7, 2024Updated last year
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
jongjyh / TrFr
View on GitHub
Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning
☆46Dec 19, 2023Updated 2 years ago
bm2-lab / CausCell
View on GitHub
☆23Jun 14, 2025Updated 11 months ago
zepingyu0512 / in-context-mechanism
View on GitHub
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Nov 17, 2024Updated last year
MiuLab / FactAlign
View on GitHub
Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"
☆19Oct 3, 2024Updated last year
michigan-traffic-lab / osaas-public
View on GitHub
☆16Feb 20, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhumeiqiBUPT / GNN-LF-HF
View on GitHub
WWW2021: Interpreting and Unifying Graph Neural Networks with An Optimization Framework
☆14Jun 23, 2021Updated 4 years ago
seatgeek / tornado-async-transformer
View on GitHub
libcst transformer that replaces tornado's legacy @gen.coroutine syntax with python3.5+ native async/await
☆21Mar 20, 2024Updated 2 years ago
AI21Labs / factor
View on GitHub
Code and data for the FACTOR paper
☆53Nov 15, 2023Updated 2 years ago
jonasrauber / linear-region-attack
View on GitHub
A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…
☆12Aug 5, 2020Updated 5 years ago
biasinrecsys / wsdm2021
View on GitHub
WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web
☆11Mar 8, 2021Updated 5 years ago
randomizedtree / segment-watermark
View on GitHub
☆19Sep 9, 2024Updated last year
JayZhang42 / SLED
View on GitHub
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆122Dec 5, 2024Updated last year
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
bangawayoo / mb-lm-watermarking
View on GitHub
multi-bit language model watermarking (NAACL 24)
☆18Sep 20, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aiovine / converse-dataset
View on GitHub
Natural language dataset for training a Conversational Recommender System
☆11Jul 9, 2019Updated 6 years ago
deegy666 / ADD-RSC
View on GitHub
Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’
☆22Dec 19, 2025Updated 5 months ago
THU-BPM / Watermark-Radioactivity-Attack
View on GitHub
[ACL 2025 Main] Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?"
☆22Jun 18, 2025Updated 11 months ago
homles11 / SaLoRA
View on GitHub
Code for “SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation(ICLR 2025)”
☆27Oct 23, 2025Updated 7 months ago
bbc / dsrp_bbcavs10k_distribution
View on GitHub
Repo for the BBCAVS10k distribution
☆10Nov 27, 2024Updated last year
leeguandong / EcommerceLLM
View on GitHub
基于电商数据微调的Qwen1.5系列的电商大模型，包括0.5b-base，0.5b-chat，1.8b-base，7b-base，以及基于llama3-chinese-sft版本的基础模型的sft后电商大模型。
☆25May 14, 2024Updated 2 years ago
smartyfh / MultiWOZ2.4
View on GitHub
MultiWOZ 2.4: A Multi-Domain Task-Oriented Dialogue Dataset
☆71Nov 9, 2022Updated 3 years ago
zaixizhang / CBD
View on GitHub
Official Inplementation of CVPR23 paper "Backdoor Defense via Deconfounded Representation Learning"
☆25Mar 13, 2023Updated 3 years ago
VProv / uncertainty_example
View on GitHub
☆13Oct 12, 2020Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆13Jul 27, 2023Updated 2 years ago
biasinrecsys / umap2020
View on GitHub
ACM UMAP2020 Hands-on Tutorial on Data and Algorithmic Bias in Recommender Systems
☆10May 23, 2021Updated 5 years ago
MohammadHeydari / Persian_FastText
View on GitHub
Persian Word Embedding Using FastText Pre-trained Model
☆13Apr 16, 2021Updated 5 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
vigilant-umbrella / wikiHowUnofficialAPI
View on GitHub
API to extract data from wikiHow
☆18Jul 10, 2021Updated 4 years ago
syncdoth / Chain-of-Hindsight-PyTorch
View on GitHub
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
☆11Apr 5, 2023Updated 3 years ago
Arvid-pku / ALCUNA
View on GitHub
[EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge
☆30Oct 30, 2023Updated 2 years ago