kevinwu23 / StanfordClashEvalView external linksLinks
☆38Jan 17, 2025Updated last year
Alternatives and similar repositories for StanfordClashEval
Users that are interested in StanfordClashEval are comparing it to the libraries listed below
Sorting:
- ☆14Oct 17, 2024Updated last year
- Open sourced result for The Agent Company☆22Nov 11, 2025Updated 3 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆81Apr 12, 2024Updated last year
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆29May 22, 2025Updated 8 months ago
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated 2 weeks ago
- Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper☆29Jun 28, 2024Updated last year
- Rethinking the User Interface of AI☆28Feb 10, 2026Updated last week
- ☆35May 21, 2025Updated 8 months ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆30Apr 13, 2023Updated 2 years ago
- Towards Systematic Measurement for Long Text Quality☆37Sep 5, 2024Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆152Sep 21, 2024Updated last year
- ☆78Nov 19, 2024Updated last year
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Apr 4, 2025Updated 10 months ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- ☆13Nov 5, 2024Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆189Sep 13, 2025Updated 5 months ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆13Nov 25, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- A sample implementation of login/registration with Cognito in React☆12Jun 23, 2023Updated 2 years ago
- ☆10Oct 20, 2020Updated 5 years ago
- ☆10Sep 23, 2020Updated 5 years ago
- An simplest PE parser, which list all import and export entries☆12Oct 11, 2018Updated 7 years ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- (NeurIPS 2024) One-shot Federated Learning via Synthetic Distiller-Distillate Communication☆13Mar 11, 2025Updated 11 months ago
- An SSH plugin for Dify☆12Jan 16, 2026Updated last month
- A platform for storing large semantic networks on MongoDB☆22Jun 20, 2011Updated 14 years ago
- LaTeX template of graduate Thesis [University of Chinese Academy of Sciences]☆12Nov 7, 2017Updated 8 years ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- my profile readme☆14Feb 10, 2026Updated last week
- Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)☆10Dec 16, 2018Updated 7 years ago
- ☆10Mar 5, 2024Updated last year
- ☆16May 13, 2021Updated 4 years ago
- 练习题,python 协同过滤ALS模型实现:商品推荐 + 用户人群放大☆10Jun 4, 2020Updated 5 years ago
- ☆18Jan 4, 2026Updated last month
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- GraphQL and Rest API rewrite of the current Open Targets platform API☆15Updated this week
- The course work repo for UoSurrey EEEM071 (2023 Spring)☆11May 9, 2023Updated 2 years ago