enjalot / latent-saeLinks

Training code for Sparse Autoencoders on Embedding models

☆38

Alternatives and similar repositories for latent-sae

Users that are interested in latent-sae are comparing it to the libraries listed below

Sorting:

Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
raphaelsty / LeNLP
NLP with Rust for Python 🦀🐍
☆64Updated 2 months ago
huggingface / wikirace-llms
☆23Updated 2 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 11 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
xjdr-alt / muzero_sketch
☆38Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
joshuacnf / Ctrl-G
☆87Updated 6 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 5 months ago
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
thomasnormal / fewshot
☆28Updated last month
allenai / infinigram-api
☆70Updated 2 weeks ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
strangeloopcanon / LLMRank
PageRank for LLMs
☆43Updated 3 months ago
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆85Updated 2 months ago
AnswerDotAI / fastkmeans
☆63Updated 3 weeks ago
KaiNylund / lm-weights-encode-time
☆69Updated 11 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆49Updated 3 months ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
allenai / adapt-demos
Lightweight tools for quick and easy LLM demo's
☆28Updated 10 months ago
euclaise / supertrainer2000
☆49Updated last year
arcee-ai / DAM
☆53Updated 8 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago