mwatkins1970 / SAE_Feature_Interpretability_ToolLinks
A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom on Gemma-2B).
☆19Updated 9 months ago
Alternatives and similar repositories for SAE_Feature_Interpretability_Tool
Users that are interested in SAE_Feature_Interpretability_Tool are comparing it to the libraries listed below
Sorting:
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 7 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆89Updated 11 months ago
- ☆10Updated 2 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated this week
- The first dense retrieval model that can be prompted like an LM☆80Updated 2 months ago
- ☆38Updated 11 months ago
- XmodelLM☆39Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆54Updated 5 months ago
- ☆13Updated 7 months ago
- ☆63Updated 9 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 4 months ago
- alternative way to calculating self attention☆18Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 3 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆47Updated 10 months ago
- ☆52Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- ☆19Updated 4 months ago
- look how they massacred my boy☆63Updated 9 months ago
- entropix style sampling + GUI☆26Updated 8 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Updated 2 years ago
- Lego for GRPO☆28Updated last month
- Source code and utilities for the Genesys distributed language model architecture discovery system.☆40Updated 2 weeks ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- ☆66Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 10 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆88Updated 2 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 8 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆45Updated 5 months ago