☆67Jan 28, 2026Updated 2 months ago
Alternatives and similar repositories for activation_oracles
Users that are interested in activation_oracles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆69Mar 22, 2026Updated last week
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- ☆105Oct 30, 2023Updated 2 years ago
- ☆14Mar 21, 2024Updated 2 years ago
- DropNet: Reducing Neural Network Complexity via Iterative Pruning (ICML 2020)☆16Aug 24, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- A study of ecosystem in Julia, as an alternative to Matlab☆12Jul 2, 2020Updated 5 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Code repo for the model organisms and convergent directions of EM papers.☆57Sep 22, 2025Updated 6 months ago
- "CCNLab: A Benchmarking Framework for Computational Cognitive Neuroscience" (NeurIPS 2021)☆10Jul 12, 2021Updated 4 years ago
- ☆10Mar 4, 2024Updated 2 years ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 4 years ago
- ☆33Jul 9, 2025Updated 8 months ago
- ☆11Jul 18, 2022Updated 3 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆20Mar 22, 2026Updated last week
- diffusers with search engine☆12Jan 13, 2026Updated 2 months ago
- Bridge Claude Code CLI with Feishu/Lark via WebSocket. 飞书 × Claude Code 实时对话。☆29Mar 22, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A boilerplate for seamlessly integrating PyTorch's Distributed Data Parallel (DDP) with SLURM job scheduling and Weights and Biases. Kick…☆10Aug 21, 2023Updated 2 years ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- ☆10Apr 5, 2022Updated 3 years ago
- Tools for optimizing steering vectors in LLMs.☆20Apr 10, 2025Updated 11 months ago
- A conda-smithy repository for memory_profiler.☆12Mar 16, 2026Updated last week
- ☆18Mar 30, 2025Updated 11 months ago
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- 【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding☆32Mar 17, 2026Updated last week
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition☆18Oct 6, 2024Updated last year
- arXiv fragment loader plugin for https://llm.datasette.io/☆18May 17, 2025Updated 10 months ago
- ☆18Dec 12, 2025Updated 3 months ago
- ☆16Dec 13, 2020Updated 5 years ago
- ☆18Aug 19, 2024Updated last year
- ☆24Oct 2, 2025Updated 5 months ago
- ☆13Oct 25, 2022Updated 3 years ago