☆87Apr 18, 2026Updated 2 months ago
Alternatives and similar repositories for activation_oracles
Users that are interested in activation_oracles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition☆18Oct 6, 2024Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Dec 1, 2024Updated last year
- ☆22Aug 10, 2024Updated last year
- This repository contains the code and data for the paper "SelfIE: Self-Interpretation of Large Language Model Embeddings" by Haozhe Chen,…☆58Dec 9, 2024Updated last year
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆74Apr 15, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆106Oct 30, 2023Updated 2 years ago
- ☆15Mar 21, 2024Updated 2 years ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 6 months ago
- A study of ecosystem in Julia, as an alternative to Matlab☆12Jul 2, 2020Updated 5 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- "CCNLab: A Benchmarking Framework for Computational Cognitive Neuroscience" (NeurIPS 2021)☆10Jul 12, 2021Updated 4 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- ☆20May 30, 2025Updated last year
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 5 years ago
- ☆26Aug 23, 2025Updated 9 months ago
- ☆11Jul 18, 2022Updated 3 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Sparse Autoencoder Training Library☆57May 1, 2025Updated last year
- ☆37Jul 9, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Topic Embedding, Text Generation and Modeling using diffusion☆15Jun 10, 2026Updated last week
- Source code and data of our paper "Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation" (https://arxiv.org/…☆10Jun 21, 2023Updated 2 years ago
- diffusers with search engine☆12Jan 13, 2026Updated 5 months ago
- ☆12Feb 28, 2025Updated last year
- Official Implementation of our ICML 2025 paper: "D-MoLE: Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction …☆27Jan 11, 2026Updated 5 months ago
- A boilerplate for seamlessly integrating PyTorch's Distributed Data Parallel (DDP) with SLURM job scheduling and Weights and Biases. Kick…☆10Aug 21, 2023Updated 2 years ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆19Oct 21, 2024Updated last year
- ☆10Apr 5, 2022Updated 4 years ago
- Tools for optimizing steering vectors in LLMs.☆22Apr 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆30Feb 6, 2026Updated 4 months ago
- A conda-smithy repository for memory_profiler.☆12Apr 22, 2026Updated last month
- ☆18Mar 30, 2025Updated last year
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆11May 2, 2023Updated 3 years ago
- A labeled dataset used for the knowledge graph construction.☆35Nov 30, 2023Updated 2 years ago