Code for steering and monitoring with concepts vectors in LLMs. https://arxiv.org/abs/2502.03708
☆29Aug 10, 2025Updated 8 months ago
Alternatives and similar repositories for neural_controllers
Users that are interested in neural_controllers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆65Apr 12, 2025Updated last year
- ☆19Feb 19, 2024Updated 2 years ago
- EigenPro Iteration in PyTorch☆19Jan 9, 2024Updated 2 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated last year
- computation of convolutional kernels (CKN and NTK) in C++☆14Dec 13, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for verifying deep neural feature ansatz☆22May 3, 2023Updated 3 years ago
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- ☆36Feb 8, 2026Updated 2 months ago
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆24Jul 26, 2024Updated last year
- ☆10Dec 17, 2019Updated 6 years ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 4 years ago
- Computing various measures and generalization bounds on convolutional and fully connected networks☆35Dec 13, 2018Updated 7 years ago
- Implementation of "Towards Understanding Mixture of Experts in Deep Learning", NeurIPS 2022☆10Jan 6, 2023Updated 3 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fork of diux-dev/imagenet18☆16Oct 4, 2018Updated 7 years ago
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 8 months ago
- Official implementation of "Multi-scale Feature Learning Dynamics: Insights for Double Descent".☆17Jun 10, 2022Updated 3 years ago
- ☆18Jun 9, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- ☆22Apr 22, 2024Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- ☆13Jul 15, 2024Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆16May 13, 2025Updated 11 months ago
- ☆13Jul 26, 2021Updated 4 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Exploring the minimal architecture required for coherent English language generation.☆13Apr 22, 2026Updated last week
- ☆75Dec 7, 2024Updated last year
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆16Jan 7, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tensorflow-keras implementation for Contrastive Reconstruction (ConRec) : a self-supervised learning algorithm that obtains image represe…☆13Feb 22, 2022Updated 4 years ago
- Repository for the NeurIPS 2023 paper "Beyond Confidence: Reliable Models Should Also Consider Atypicality"☆13Apr 21, 2024Updated 2 years ago
- [AAAI 21] Utilizing meta-learning to correct the noisy labels.☆15Apr 26, 2021Updated 5 years ago
- ☆15Aug 7, 2021Updated 4 years ago
- ☆14Dec 12, 2024Updated last year
- Tutorials for getting the most out of Matplotlib☆23Nov 17, 2020Updated 5 years ago
- ☆12Oct 3, 2018Updated 7 years ago