mwatkins1970 / SAE_Feature_Interpretability_ToolLinks

A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom on Gemma-2B).

☆19

Alternatives and similar repositories for SAE_Feature_Interpretability_Tool

Users that are interested in SAE_Feature_Interpretability_Tool are comparing it to the libraries listed below

Sorting:

rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
XiaoduoAILab / XmodelLM
XmodelLM
☆39Updated 8 months ago
rosewang2008 / posr
Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings
☆33Updated 8 months ago
orionw / promptriever
The first dense retrieval model that can be prompted like an LM
☆82Updated 3 months ago
YutongWang1216 / DocMTAgent
Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
☆45Updated 6 months ago
KempnerInstitute / traveling-waves-integrate
Repository to create traveling waves integrate special information through time
☆53Updated this week
brendanhogan / completion_tree_view
☆13Updated 3 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated this week
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 4 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
ANTONIOPSD / CaptionIMG
Simple program to manually caption your images (or any other file types) so you can use them for AI training
☆37Updated 2 years ago
iulia-b10 / multilingual-embedding-models
☆20Updated last year
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
thu-spmi / PPT2DST
☆11Updated last year
superagi / Veagle
Enhancement in Multimodal Representation Learning.
☆40Updated last year
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆14Updated 3 weeks ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
Babelscape / LLM-Oasis
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆23Updated 8 months ago
PootieT / explain-then-translate
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…
☆29Updated last year
nateraw / modal-examples
Apps that run on modal.com
☆12Updated last month
MaxBelitsky / cache-steering
KV Cache Steering for Inducing Reasoning in Small Language Models
☆36Updated 2 weeks ago
MNoorFawi / curlora
The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.
☆47Updated 11 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
LAGoM-NLP / transtokenizer
☆51Updated 6 months ago
CERC-AAI / Robin
☆63Updated 10 months ago
Zoeyyao27 / SirLLM
This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM
☆59Updated last year
annahedstroem / sanity-checks-revisited
[NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"
☆25Updated last year