☆38Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below
Sorting:
- ☆14Oct 17, 2024Updated last year
- Open sourced result for The Agent Company☆21Nov 11, 2025Updated 4 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 9 months ago
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆24Oct 22, 2024Updated last year
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated last month
- Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper☆29Jun 28, 2024Updated last year
- ☆13Oct 5, 2025Updated 5 months ago
- ☆36May 21, 2025Updated 9 months ago
- ☆36Nov 15, 2023Updated 2 years ago
- Study and research with your docs, media, and AI in one place☆34Updated this week
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆37Sep 5, 2024Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆153Sep 21, 2024Updated last year
- ☆79Nov 19, 2024Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Apr 4, 2025Updated 11 months ago
- ☆13Nov 5, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- ☆12Jul 8, 2024Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- A platform for storing large semantic networks on MongoDB☆22Jun 20, 2011Updated 14 years ago
- An simplest PE parser, which list all import and export entries☆12Oct 11, 2018Updated 7 years ago
- LaTeX template of graduate Thesis [University of Chinese Academy of Sciences]☆12Nov 7, 2017Updated 8 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- ☆10Oct 20, 2020Updated 5 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- Unofficial implementation of "Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle"☆13Jul 3, 2024Updated last year
- Modify ELF executables☆16Mar 5, 2019Updated 7 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Oct 29, 2024Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Check for know iframeBuster XSS☆12Sep 25, 2024Updated last year
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- Agent installed on node to launch IDA,Bindiff,... and send results to the server ( AutoDiffWeb )☆10Mar 25, 2016Updated 9 years ago