☆15Feb 5, 2025Updated last year
Alternatives and similar repositories for sycophancy-interpretability
Users that are interested in sycophancy-interpretability are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Read emotion with a line of code 🎭☆18Jan 2, 2025Updated last year
- CUPCase: Clinically Uncommon Patient Cases and Diagnoses Dataset☆14Oct 12, 2025Updated 5 months ago
- ☆45Jan 4, 2022Updated 4 years ago
- console.log for your stdio MCP server☆23Apr 1, 2025Updated 11 months ago
- linux && windows compatible caffe☆13Dec 4, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A CNN feature based image retrieval website☆15May 16, 2017Updated 8 years ago
- ☆13Feb 3, 2021Updated 5 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Pushing CIFAR-10 SOTA using ResNets.☆17Oct 17, 2025Updated 5 months ago
- ☆14May 7, 2022Updated 3 years ago
- ☆30Apr 26, 2025Updated 11 months ago
- NAEP Math Assessment Item Score Prediction Challenge (Spring 2023)☆15Jun 8, 2023Updated 2 years ago
- Official code for PLoP☆18Mar 6, 2026Updated 3 weeks ago
- Code for "Adversarial Defense by Stratified Convolutional Sparse Coding"☆19Jul 27, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 9 months ago
- An analysis of which factors best predict the spread of forest fires using data from Portugal and California.☆159Aug 3, 2025Updated 7 months ago
- Python wrapper to extract CNN embeddings with Oxford VGG models using caffe.☆27May 13, 2015Updated 10 years ago
- ☆23Oct 27, 2023Updated 2 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- Tutorials that take an in depth look at how to view and manipulate DICOM images and how to get them ready for machine learning☆27Apr 12, 2023Updated 2 years ago
- ☆26Jan 23, 2024Updated 2 years ago
- alibabacloud-quantization-networks☆121Nov 8, 2019Updated 6 years ago
- Code and Data for WWW'23 paper Reinforcement Learning-based Counter-Misinformation Response Generation: A Case Study of COVID-19 Vaccine …☆27Jun 28, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official implementation of Visco-Attack (EMNLP 2025 Main). We will progressively release the code and one-click reproduction scripts.☆30Aug 22, 2025Updated 7 months ago
- Model Selection with Large Language Models for Reasoning (EMNLP2023 Findings)☆30Dec 23, 2023Updated 2 years ago
- Python client for the Human Brain Project Neuromorphic Computing Platform☆62Aug 27, 2025Updated 7 months ago
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆20Dec 6, 2024Updated last year
- Kicad Library Files for an 0.91" 128x32 OLED Display☆53Nov 26, 2025Updated 4 months ago
- GestureSesh is a free and open-source application that displays image files based on a user defined schedule. After images are selected,…☆51Mar 5, 2026Updated 3 weeks ago
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 6 months ago
- ☆23Mar 21, 2025Updated last year
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆22Apr 26, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Asoul女团的桌宠!本作品已获得字节跳动x稀土掘金 2022编程挑战赛 第二名以及最佳人气奖☆16Jun 19, 2022Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)☆23Jul 9, 2024Updated last year
- the Socket Testing Assistant is used for socket test sending and receiving data on Mac OS X☆59Jan 17, 2011Updated 15 years ago
- tutorials☆22Aug 12, 2022Updated 3 years ago
- An implementation of SEAL: Safety-Enhanced Aligned LLM fine-tuning via bilevel data selection.☆24Feb 20, 2025Updated last year
- Code for 'CausalAdv: Adversarial Robustness Through the Lens of Causality'☆43Jan 19, 2024Updated 2 years ago