mwatkins1970 / SAE_Feature_Interpretability_ToolView external linksLinks
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom on Gemma-2B).
☆19Oct 4, 2024Updated last year
Alternatives and similar repositories for SAE_Feature_Interpretability_Tool
Users that are interested in SAE_Feature_Interpretability_Tool are comparing it to the libraries listed below
Sorting:
- Code for Self-Assessed Generation and CVPR2024 PAPER ADFACTORY☆21Jul 28, 2025Updated 6 months ago
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆26Aug 23, 2025Updated 5 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆25Oct 15, 2025Updated 4 months ago
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 3 months ago
- SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation (AAAI24)☆25Jul 2, 2024Updated last year
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆56May 3, 2025Updated 9 months ago
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆14Apr 3, 2025Updated 10 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 9 months ago
- Wonderful Matrices to Build Small Language Models☆44Feb 15, 2025Updated last year
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Dec 6, 2025Updated 2 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆59Feb 10, 2025Updated last year
- Official implementation of Inconsistency Masks. A robust semi-supervised segmentation framework that reframes model disagreement as a…☆19Jan 23, 2026Updated 3 weeks ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated 10 months ago
- ☆22Mar 23, 2025Updated 10 months ago
- ☆43May 6, 2024Updated last year
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Oct 23, 2024Updated last year
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 8 months ago
- Experimental GPU language with meta-programming☆25Sep 6, 2024Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Dec 10, 2024Updated last year
- [ACL 2025] RetroLLM: Empowering LLMs to Retrieve Fine-grained Evidence within Generation☆118Jan 23, 2025Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Feb 10, 2025Updated last year
- Agile metrics tools allows you to track metrics from different sources in order to identify trends and patterns on how your team performa…☆11Jan 2, 2026Updated last month
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- Forecasting.☆37Aug 2, 2025Updated 6 months ago
- [ACL 2025 Long Main] Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions☆38Apr 21, 2025Updated 9 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- ☆37Feb 8, 2026Updated last week
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆144Oct 13, 2025Updated 4 months ago
- Statewide Visual Geolocalization in the Wild (ECCV 2024)☆73Dec 2, 2024Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆44Feb 15, 2024Updated 2 years ago
- This is the pytorch implement of our paper "CCExpert: Advancing MLLM Capability in Remote Sensing Change Captioning with Difference-Aware…☆37Nov 20, 2024Updated last year
- Asimov helps you build high performance LLM apps, written in Rust 🦀☆11Jun 28, 2024Updated last year
- Structured Data Extractor for AI Agents. Search your documents or the web for specific data and get it back in JSON or Markdown in a sing…☆182Jan 5, 2026Updated last month
- [IJCAI'23] Complete Instances Mining for Weakly Supervised Instance Segmentation☆38Feb 14, 2024Updated 2 years ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆41May 20, 2024Updated last year