eujhwang / personalized-llmsLinks
personalized-llms with allen institute
☆15Updated 2 years ago
Alternatives and similar repositories for personalized-llms
Users that are interested in personalized-llms are comparing it to the libraries listed below
Sorting:
- ☆44Updated 10 months ago
- This repository contains data, code and models for contextual noncompliance.☆23Updated 11 months ago
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆17Updated last month
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- AbstainQA, ACL 2024☆27Updated 9 months ago
- ☆22Updated 7 months ago
- Code for our BlackboxNLP'20 paper "BERTnesia: Investigating the capture and forgetting of knowledge in BERT"☆9Updated 3 years ago
- PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)☆74Updated 8 months ago
- ☆26Updated last year
- Code and data for the FACTOR paper☆48Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- ☆43Updated 11 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Official codebase for permutation self-consistency.☆18Updated last year
- ☆72Updated last year
- ☆43Updated last year
- Evaluate the Quality of Critique☆36Updated last year
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆24Updated 4 months ago
- ☆21Updated last year
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆63Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆30Updated 10 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 7 months ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆21Updated 9 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- ☆31Updated 8 months ago