Jiaxin-Wen/Unsupervised-Elicitation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jiaxin-Wen/Unsupervised-Elicitation)

Jiaxin-Wen / Unsupervised-Elicitation

☆41

Alternatives and similar repositories for Unsupervised-Elicitation

Users that are interested in Unsupervised-Elicitation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

safety-research / SHADE-Arena
View on GitHub
☆26Jun 22, 2025Updated last year
lingo-mit / lm-truthfulness
View on GitHub
☆17Dec 21, 2023Updated 2 years ago
Jiaxin-Wen / GDsuite
View on GitHub
A toy eval suite for tracing generalization dynamics of LM pre-training
☆19May 19, 2026Updated 2 months ago
rgreenblatt / model_organism_public
View on GitHub
☆15Jun 17, 2025Updated last year
codelion / icm
View on GitHub
Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs
☆27Sep 5, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
EleutherAI / deep-ignorance
View on GitHub
☆20Jan 7, 2026Updated 6 months ago
shauli-ravfogel / adv-kernel-removal
View on GitHub
☆12Oct 23, 2022Updated 3 years ago
zroe1 / xlab-ai-security
View on GitHub
An online AI security course created by UChicago's XLab
☆38Feb 21, 2026Updated 5 months ago
safety-research / open-source-alignment-faking
View on GitHub
Open Source Replication of Anthropic's Alignment Faking Paper
☆58Apr 4, 2025Updated last year
annahdo / implementing_activation_steering
View on GitHub
A collection of different ways to implement accessing and modifying internal model activations for LLMs
☆24Oct 18, 2024Updated last year
Responsible-Dataset-Sharing / easy-dataset-share
View on GitHub
A CLI tool that helps AI researchers share datasets responsibly.
☆22Sep 15, 2025Updated 10 months ago
JunsolKim / RepresentationPoliticalLLM
View on GitHub
Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.
☆25Mar 27, 2025Updated last year
EleutherAI / elk-generalization
View on GitHub
Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…
☆33May 23, 2024Updated 2 years ago
jkutaso / SHADE-Arena
View on GitHub
☆57May 9, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
fangqin0703 / cs-7643-deep-learning
View on GitHub
Code for assignments of the graduate course CS 7643: Deep Learning offered at Georgia Tech in Fall 2018.
☆10Dec 23, 2018Updated 7 years ago
cydu24 / HER
View on GitHub
☆23Jan 30, 2026Updated 5 months ago
johnyang101 / reticular-sae
View on GitHub
Official repo of "Towards Interpretable Protein Structure Prediction with Sparse Autoencoders" published at ICLR 2025 GEM workshop.
☆17Mar 13, 2025Updated last year
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
54rt1n / shardmerge
View on GitHub
Using fourier interpolation to merge large language models
☆11Jul 11, 2026Updated 2 weeks ago
TeunvdWeij / sandbagging
View on GitHub
☆21Nov 15, 2024Updated last year
PPKFS / roguefunctor
View on GitHub
A Haskell roguelike toolkit
☆12Jul 10, 2025Updated last year
PKU-Alignment / aligner
View on GitHub
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
☆194Jan 16, 2025Updated last year
tatsu-lab / linguistic_calibration
View on GitHub
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆30Jun 4, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pphuc25 / face-recognition-convnext
View on GitHub
😉 Face Recognition using Convnext model + Flask
☆18Aug 25, 2023Updated 2 years ago
kreimanlab / WhenPigsFlyContext
View on GitHub
☆18May 14, 2022Updated 4 years ago
harvard-visionlab / open_ipcl
View on GitHub
official repository for the Instance Prototype Contrastive Learning (IPCL)
☆18Jun 20, 2022Updated 4 years ago
rgreenblatt / control-evaluations
View on GitHub
☆25May 25, 2024Updated 2 years ago
HazyResearch / correct-n-contrast
View on GitHub
Official code repository for Correct-N-Contrast
☆22Jul 18, 2022Updated 4 years ago
awwang10 / llmpromptboosting
View on GitHub
Accompanying code for "Boosted Prompt Ensembles for Large Language Models"
☆31Apr 13, 2023Updated 3 years ago
huanranchen / LLMLandscape
View on GitHub
The loss landscape of Large Language Models resemble basin!
☆41Jul 8, 2025Updated last year
longday1102 / VietAI-experiment-LLaMA2
View on GitHub
⚡ LLaMA-2 model experiment
☆12Nov 22, 2023Updated 2 years ago
zzzace2000 / robust_cls_model
View on GitHub
The code to reproduce CVPR 2021 paper "Towards Robust Classification Model by Counterfactual and Invariant Data Generation"
☆16Jul 29, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Newbeeer / TRM
View on GitHub
Learning Representations that Support Robust Transfer of Predictors
☆20Nov 7, 2021Updated 4 years ago
sunblaze-ucb / Intuitor
View on GitHub
[ICLR 2026] Learning to Reason without External Rewards
☆420Jan 26, 2026Updated 6 months ago
erobic / ramen
View on GitHub
This is a pytorch implementation of our Recurrent Aggregation of Multimodal Embeddings Network (RAMEN) from our CVPR-2019 paper.
☆17Apr 5, 2020Updated 6 years ago
Geralt-Targaryen / MC-Evaluation
View on GitHub
☆14May 21, 2024Updated 2 years ago
kanishkg / stream-of-search
View on GitHub
Repository for the paper Stream of Search: Learning to Search in Language
☆154Feb 3, 2025Updated last year
edenbiran / HoppingTooLate
View on GitHub
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆33Mar 2, 2025Updated last year
bsorsch / geometry-fewshot-learning
View on GitHub
☆19Apr 19, 2022Updated 4 years ago