mwatkins1970 / SAE_Feature_Interpretability_ToolLinks
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom on Gemma-2B).
☆19Updated 10 months ago
Alternatives and similar repositories for SAE_Feature_Interpretability_Tool
Users that are interested in SAE_Feature_Interpretability_Tool are comparing it to the libraries listed below
Sorting:
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆89Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- XmodelLM☆39Updated 8 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆33Updated 8 months ago
- The first dense retrieval model that can be prompted like an LM☆82Updated 3 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆45Updated 6 months ago
- Repository to create traveling waves integrate special information through time☆53Updated this week
- ☆13Updated 3 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated this week
- Using modal.com to process FineWeb-edu data☆20Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 9 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated 2 years ago
- ☆20Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated last year
- ☆11Updated last year
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Fork of Flame repo for training of some new stuff in development☆14Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…☆23Updated 8 months ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆29Updated last year
- Apps that run on modal.com☆12Updated last month
- KV Cache Steering for Inducing Reasoning in Small Language Models☆36Updated 2 weeks ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆47Updated 11 months ago
- Lego for GRPO☆28Updated 2 months ago
- BH hackathon☆14Updated last year
- ☆51Updated 6 months ago
- ☆63Updated 10 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year