Training code for Sparse Autoencoders on Embedding models
☆39Feb 27, 2025Updated last year
Alternatives and similar repositories for latent-sae
Users that are interested in latent-sae are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 11 months ago
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- ☆20Nov 18, 2024Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- User-friendly viewer for Parquet files☆10Updated this week
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- A collection of tools for your LLMs that run on Modal☆23Feb 28, 2025Updated last year
- Sparsify transformers with SAEs and transcoders☆699Mar 2, 2026Updated last week
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated 9 months ago
- Finetune your embeddings in-browser☆34Apr 14, 2024Updated last year
- ☆57Jan 26, 2025Updated last year
- ☆14Jul 7, 2024Updated last year
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 10 months ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆16Feb 13, 2026Updated 3 weeks ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- ☆26Oct 27, 2025Updated 4 months ago
- code for training & evaluating Contextual Document Embedding models☆201May 14, 2025Updated 9 months ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 4 months ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 6 months ago
- Pre-train Static Word Embeddings☆93Sep 9, 2025Updated 6 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Sep 6, 2023Updated 2 years ago
- Multimodal extreme classification☆20May 1, 2024Updated last year
- A single static file as vector database, using the cloud-native flatgeobuf file format and http range requests☆17Oct 28, 2025Updated 4 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- Contextualized per-token embeddings☆34May 11, 2025Updated 9 months ago
- ☆22Mar 28, 2024Updated last year
- FormFill is a CLI tool that uses LLMs to automatically fill out PDF forms.☆29Nov 22, 2024Updated last year
- Late Interaction Models Training & Retrieval☆740Updated this week
- ☆47Mar 27, 2022Updated 3 years ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- Simplified implementation of UMAP like dimensionality reduction algorithm☆53Nov 18, 2024Updated last year
- Fast reinforcement learning 💨☆28Jul 15, 2025Updated 7 months ago
- ☆27Aug 1, 2024Updated last year
- Visualization and sparse autoencoder training for mechanistic interpretability on audio models☆23Apr 6, 2025Updated 11 months ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- ☆28Oct 7, 2025Updated 5 months ago