☆31Nov 7, 2024Updated last year
Alternatives and similar repositories for Truth_is_Universal
Users that are interested in Truth_is_Universal are comparing it to the libraries listed below
Sorting:
- ☆100Aug 8, 2024Updated last year
- ☆28Nov 16, 2025Updated 3 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Jun 19, 2024Updated last year
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projections☆13Nov 1, 2024Updated last year
- Digital Innovation Festival React Typescript workshop☆10Jan 6, 2023Updated 3 years ago
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ☆15Aug 19, 2025Updated 6 months ago
- ☆13Apr 10, 2025Updated 10 months ago
- Template for Python-based data science projects in the Alexandra Institute.☆12Feb 15, 2026Updated 2 weeks ago
- Official Implementation of "The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thou…☆14Jul 2, 2025Updated 8 months ago
- ☆15Mar 13, 2025Updated 11 months ago
- Math evaluations of llama models.☆10Jan 3, 2024Updated 2 years ago
- Jupyter notebooks for cloud-based usage☆10Aug 26, 2023Updated 2 years ago
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- Improving transparency of large language models' reasoning☆14Nov 25, 2025Updated 3 months ago
- ☆14Mar 15, 2025Updated 11 months ago
- ☆11Mar 21, 2024Updated last year
- 基于 OpenComputers 的 GTNH-AE2 远程控制框架,简称GTNH赛博监工,支持网页远程下单☆17Jan 1, 2026Updated 2 months ago
- A series of BERT and Albert model checkpoints trained to reduce gendered correlations in pre-training☆11Oct 22, 2020Updated 5 years ago
- Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …☆10May 20, 2023Updated 2 years ago
- Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"☆17Jul 27, 2024Updated last year
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- The Regularization Cookbook, published by Packt☆16Updated this week
- DISSECT: Disentangled Simultaneous Explanations via Concept Traversals☆12Feb 5, 2024Updated 2 years ago
- ☆10May 26, 2020Updated 5 years ago
- This is the official PyTorch implementation for the paper: "Directed Acyclic Graph Factorization Machines for CTR Prediction via Knowledg…☆14Mar 5, 2023Updated 3 years ago
- ☆11May 27, 2023Updated 2 years ago
- Localization of Knowledge in Text-to-Image Models☆12Oct 8, 2024Updated last year
- 'Robust Semantic Interpretability: Revisiting Concept Activation Vectors' Official Implementation☆11Jul 15, 2020Updated 5 years ago
- Code for "Investigating and Simplifying Masking-based Saliency Methods for Model Interpretability" (https://arxiv.org/abs/2010.09750)☆14Nov 10, 2020Updated 5 years ago
- Discriminative Feature Selection via A Structured Sparse Subspace Learning Module☆12Apr 15, 2022Updated 3 years ago
- Open Finnish NLP datasets☆14Jan 3, 2025Updated last year
- ☆12Mar 19, 2021Updated 4 years ago
- Establishing Quantified Uncertainty in Neural Networks☆15Jan 14, 2026Updated last month
- ☆12Jun 12, 2023Updated 2 years ago
- A tiny easily hackable implementation of a feature dashboard.☆15Oct 21, 2025Updated 4 months ago
- Advanced Android MOOC - OpenGL Example☆14Apr 8, 2019Updated 6 years ago
- ☆13Nov 17, 2024Updated last year