mwatkins1970 / SAE_Feature_Interpretability_Tool
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom on Gemma-2B).
☆19Updated 5 months ago
Alternatives and similar repositories for SAE_Feature_Interpretability_Tool:
Users that are interested in SAE_Feature_Interpretability_Tool are comparing it to the libraries listed below
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆30Updated 4 months ago
- ☆13Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 3 weeks ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆38Updated last month
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆28Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated 8 months ago
- The first dense retrieval model that can be prompted like an LM☆67Updated 6 months ago
- ☆32Updated 3 weeks ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated last month
- ☆16Updated 3 weeks ago
- Latent Large Language Models☆17Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 8 months ago
- ☆15Updated 6 months ago
- XmodelLM☆39Updated 4 months ago
- BH hackathon☆14Updated 11 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆28Updated last month
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- ☆42Updated 2 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆31Updated last month
- ☆38Updated last month
- alternative way to calculating self attention☆18Updated 10 months ago
- ☆51Updated 4 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- ☆20Updated 3 weeks ago