acl-org / acl-2025Links
☆12Updated 3 months ago
Alternatives and similar repositories for acl-2025
Users that are interested in acl-2025 are comparing it to the libraries listed below
Sorting:
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆162Updated 4 months ago
- Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs☆295Updated last year
- Codebase for reproducing the experiments of the semantic uncertainty paper (short-phrase and sentence-length experiments).☆385Updated last year
- BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).☆163Updated 2 years ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆163Updated 8 months ago
- Multilingual Large Language Models Evaluation Benchmark☆132Updated last year
- A Survey of Attributions for Large Language Models☆218Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆520Updated last year
- LLM hallucination paper list☆323Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆398Updated 6 months ago
- ☆155Updated last year
- LLM Unlearning☆177Updated 2 years ago
- Benchmarking LLMs' Emotional Alignment with Humans☆114Updated 9 months ago
- ☆85Updated 10 months ago
- [ICML 2024] TrustLLM: Trustworthiness in Large Language Models☆604Updated 4 months ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆233Updated 2 years ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆40Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 8 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆50Updated 2 years ago
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆190Updated 2 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆79Updated 11 months ago
- ☆155Updated 2 years ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆165Updated last year
- [NeurIPS 2024] How do Large Language Models Handle Multilingualism?☆44Updated last year
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆127Updated 3 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆235Updated 10 months ago
- Repository for the Bias Benchmark for QA dataset.☆129Updated last year
- Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"☆22Updated 5 months ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆35Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆60Updated last year