Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆35Jan 31, 2025Updated last year
Alternatives and similar repositories for RE-Control
Users that are interested in RE-Control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.…☆28Sep 25, 2024Updated last year
- Code for paper: End-to-end Stochastic Optimization with Energy-based Model☆16Feb 14, 2023Updated 3 years ago
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Jun 4, 2020Updated 5 years ago
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆575Jan 28, 2025Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆25Feb 15, 2024Updated 2 years ago
- ☆11Nov 13, 2024Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆25Sep 19, 2024Updated last year
- [ICLR 2022] Official Code Repository for "TRGP: TRUST REGION GRADIENT PROJECTION FOR CONTINUAL LEARNING"☆22Oct 5, 2022Updated 3 years ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆36Nov 16, 2020Updated 5 years ago
- ☆17Apr 23, 2026Updated last week
- An exploration of LLM steering☆26Jun 15, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the code repository of our submission: Understanding the Dark Side of LLMs’ Intrinsic Self-Correction.☆61Dec 20, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- Source code of MOLLEO☆56Jul 8, 2025Updated 9 months ago
- CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation☆14Aug 19, 2025Updated 8 months ago
- ☆16Oct 18, 2024Updated last year
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆68Dec 10, 2024Updated last year
- Testing Difference Target Propagation (DTP) on MNIST.☆13Oct 12, 2020Updated 5 years ago
- ☆15Apr 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of CVPR 2025 paper "Invisible Backdoor Attack against Self-supervised Learning"☆17Jul 5, 2025Updated 10 months ago
- Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".☆12Jan 9, 2025Updated last year
- ☆13Jan 22, 2025Updated last year
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆27Feb 25, 2025Updated last year
- ☆11May 14, 2024Updated last year
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"☆145Mar 26, 2024Updated 2 years ago
- Uncover meaningful structures of latent spaces learned by generative models with flows!☆45May 10, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Evaluation of generated videos on the FETV benchmark☆10Apr 6, 2025Updated last year
- My personal site, using Wowchemy☆13Apr 24, 2026Updated last week
- [ICML2025] Test-Time Learning for Large Language Models☆55Jan 31, 2026Updated 3 months ago
- Injecting watermarks to protein sequences for privacy protection in biosecurity☆10Oct 1, 2024Updated last year
- An PyTorch reimplementation of bottom-up-attention models☆16Jan 5, 2021Updated 5 years ago
- PyTorch Implementation of Small Lesion Segmentation in Brain MRIs with Subpixel Embedding (ORAL, MICCAIW 2021)☆22Jul 1, 2022Updated 3 years ago
- Mirror of Apache Ode Jacob☆12Mar 8, 2018Updated 8 years ago