☆17Dec 21, 2023Updated 2 years ago
Alternatives and similar repositories for lm-truthfulness
Users that are interested in lm-truthfulness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆106Aug 8, 2024Updated last year
- ☆22Dec 11, 2024Updated last year
- A text-based game where language models learn to lie and to detect lies.☆12Oct 4, 2023Updated 2 years ago
- ☆16Sep 27, 2023Updated 2 years ago
- ☆50Jan 7, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆130Feb 10, 2026Updated 2 months ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- ☆21Aug 19, 2024Updated last year
- This repo contains information about FeB4RAG collection☆17Feb 19, 2024Updated 2 years ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆18Oct 7, 2025Updated 6 months ago
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆64Dec 25, 2023Updated 2 years ago
- ☆41Feb 11, 2025Updated last year
- ☆14May 12, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code to generate visual metamers via foveated feed-forward style transfer (ICLR 2019)☆19Apr 13, 2021Updated 5 years ago
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- official repository for the Instance Prototype Contrastive Learning (IPCL)☆18Jun 20, 2022Updated 3 years ago
- The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"☆16Jul 29, 2021Updated 4 years ago
- ☆14Oct 28, 2023Updated 2 years ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- Benchmark for Hetergeneous Federated Learning by MARS Group at the Wuhan University, led by Prof. Mang Ye.☆19May 29, 2023Updated 2 years ago
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- (Pytorch ver) Code for "Fully Neural Network based Model for General Temporal Point Process"☆21Sep 15, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆130Mar 22, 2024Updated 2 years ago
- ☆43Sep 3, 2024Updated last year
- Code for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''☆41Sep 9, 2019Updated 6 years ago
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- ☆29Nov 9, 2025Updated 5 months ago
- ☆100Updated this week
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆55May 12, 2025Updated 11 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆10Jul 15, 2024Updated last year
- A tqdm multi-thread helper☆11Aug 12, 2019Updated 6 years ago
- ☆68Jun 27, 2022Updated 3 years ago
- ☆10Jul 28, 2022Updated 3 years ago
- Model in the loop approach for fig lang generation and explainibilty Code and Data for EMNLP 2022 paper FLUTE: Figurative Language Unders…☆13Apr 22, 2023Updated 3 years ago