eujhwang / personalized-llmsLinks

personalized-llms with allen institute

☆15

Alternatives and similar repositories for personalized-llms

Users that are interested in personalized-llms are comparing it to the libraries listed below

Sorting:

Nanami18 / Snowballed_Hallucination
☆44Updated 10 months ago
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆23Updated 11 months ago
yikee / Knowledge_Conflict
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆17Updated last month
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆27Updated 9 months ago
technion-cs-nlp / hallucination-mitigation
☆22Updated 7 months ago
jwallat / knowledge-probing
Code for our BlackboxNLP'20 paper "BERTnesia: Investigating the capture and forgetting of knowledge in BERT"
☆9Updated 3 years ago
ruiyiw / patient-psi
PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)
☆74Updated 8 months ago
scandukuri / assistant-gate
☆26Updated last year
AI21Labs / factor
Code and data for the FACTOR paper
☆48Updated last year
McGill-NLP / instruct-qa
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆86Updated 11 months ago
OSU-NLP-Group / AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Updated 2 years ago
microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆94Updated last year
princeton-nlp / LM-Science-Tutor
☆43Updated 11 months ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆40Updated 2 years ago
skywalker023 / fantom
👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"
☆55Updated last year
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
castorini / perm-sc
Official codebase for permutation self-consistency.
☆18Updated last year
abhika-m / FAVA
☆72Updated last year
SALT-NLP / CultureBank
☆43Updated last year
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
Betswish / MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆24Updated 4 months ago
msclar / symbolictom
☆21Updated last year
behavioral-data / Cognitive-Reframing
Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts
☆63Updated last year
psunlpgroup / ReaLMistake
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆30Updated 10 months ago
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆28Updated last year
neulab / data-agora
[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆33Updated 7 months ago
ritaranx / BMRetriever
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆21Updated 9 months ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
Scarelette / CultureLLM
☆31Updated 8 months ago