dhgottesman/keen_estimating_knowledge_in_llms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dhgottesman/keen_estimating_knowledge_in_llms)

dhgottesman / keen_estimating_knowledge_in_llms

☆18

Alternatives and similar repositories for keen_estimating_knowledge_in_llms

Users that are interested in keen_estimating_knowledge_in_llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apartresearch / specificityplus
View on GitHub
👩‍💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"
☆20Jan 19, 2024Updated 2 years ago
amitlevy / evolutionaryGPT
View on GitHub
Evolutionary Search for expert-level performance on any task with environmental feedback
☆14Oct 12, 2025Updated 9 months ago
multimodal-ai-lab / DEFAME
View on GitHub
Fact-checking system for textual and visual inputs.
☆59May 21, 2026Updated last month
amazon-science / contrastive-controlled-mt
View on GitHub
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆22May 24, 2023Updated 3 years ago
nju-websoft / CCA
View on GitHub
Knowledge Graph Error Detection with Contrastive Confidence Adaption, AAAI 2024
☆19May 3, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
edenbiran / HoppingTooLate
View on GitHub
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆33Mar 2, 2025Updated last year
LeslieOverfitting / selective_distillation
View on GitHub
☆38Jun 3, 2021Updated 5 years ago
apple / ml-tic-lm
View on GitHub
Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025
☆24Apr 19, 2026Updated 2 months ago
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
li-xirong / flickr8kcn
View on GitHub
A bilingual dataset for image captioning
☆19Oct 28, 2020Updated 5 years ago
open-compass / CIBench
View on GitHub
Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "
☆15Jul 19, 2024Updated last year
strongio / dosing-rl-gym
View on GitHub
Patient data simulator following the structure of an open-ai gym.
☆12Jul 9, 2019Updated 7 years ago
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
MichaelEinhorn / Composer
View on GitHub
Generates video game music using neural networks.
☆10Jun 9, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xiye17 / TextualExplInContext
View on GitHub
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆16Feb 11, 2023Updated 3 years ago
benelot / Composer
View on GitHub
Generates video game music using neural networks.
☆12Jun 9, 2022Updated 4 years ago
XiaojuanTang / ICSR
View on GitHub
implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"
☆20Aug 17, 2023Updated 2 years ago
shachardon / naturally_occurring_feedback
View on GitHub
☆14Dec 1, 2025Updated 7 months ago
McGill-NLP / instruct-qa
View on GitHub
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆87Aug 12, 2024Updated last year
HZ757 / reddit-liberation-extension
View on GitHub
A Chrome Extension for reddit users to minimize lost productivity on reddit
☆14Apr 4, 2025Updated last year
kukrishna / genaudit
View on GitHub
☆15Mar 29, 2025Updated last year
openfactcheck-research / openfactcheck
View on GitHub
An Open-source Factuality Evaluation Demo for LLMs
☆30Updated this week
khuangaf / ZeroFEC
View on GitHub
Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"
☆17Aug 14, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YihongDong / CDD-TED4LLMs
View on GitHub
☆16Nov 26, 2024Updated last year
ordavid-s / snmf-mlp-decomposition
View on GitHub
☆15Jul 7, 2026Updated last week
zjulgc / llmpeft4apr
View on GitHub
☆16Nov 9, 2024Updated last year
yoichi1484 / subspace
View on GitHub
An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)
☆10May 31, 2024Updated 2 years ago
multi-swe-bench / MagentLess
View on GitHub
☆13Jul 31, 2025Updated 11 months ago
amitlevy / finetune_llama_3_own_data
View on GitHub
Simple notebook to train (technically, fine-tune) llama 3 8B on your own text data!
☆24May 5, 2024Updated 2 years ago
abietti / transformer-birth
View on GitHub
☆19Dec 12, 2023Updated 2 years ago
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
nicoladainese96 / code-world-models
View on GitHub
Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.
☆20Feb 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
apalle1 / Sentiment-Span-Extraction-Using-Transformer-Models
View on GitHub
PyTorch - Albert Large V2, Bert Base Uncased, Bert Large Uncased WWM Finetuned Squad, Distil Roberta Base, Roberta Base Squad2, Roberta l…
☆11Jul 10, 2020Updated 6 years ago
RossiXu / event-centric-opinion-mining
View on GitHub
☆12Nov 20, 2023Updated 2 years ago
JunsolKim / RepresentationPoliticalLLM
View on GitHub
Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.
☆25Mar 27, 2025Updated last year
UCSB-AI / Mitigate-Gender-Bias-in-Image-Search
View on GitHub
Code for the EMNLP 2021 Oral paper "Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search" https://arx…
☆12Feb 6, 2023Updated 3 years ago
Shujun-He / 3rd_Solution_Feedback_Prize_Evaluating_Student_Writing
View on GitHub
☆13Apr 16, 2022Updated 4 years ago
Liadrinz / transformers-copy-mechanism
View on GitHub
Overwrite huggingface BART and GPT with copy mechanism
☆21May 3, 2023Updated 3 years ago
nistvan86 / continuedev-llamacpp-gpu-llm-server
View on GitHub
☆10Nov 22, 2023Updated 2 years ago