Training code for Sparse Autoencoders on Embedding models
☆39Jun 16, 2026Updated this week
Alternatives and similar repositories for latent-sae
Users that are interested in latent-sae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using modal.com to process FineWeb-edu data☆20Apr 11, 2026Updated 2 months ago
- A collection of tools for your LLMs that run on Modal☆26Feb 28, 2025Updated last year
- Live-editable codeblocks for any language.☆16Dec 13, 2025Updated 6 months ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- MEXMA: Token-level objectives improve sentence representations☆43Jan 6, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Sparsify transformers with SAEs and transcoders☆727Updated this week
- ☆21Nov 18, 2024Updated last year
- A simple, yet comprehensive foundation for interacting with common cloud providers in Julia (GCP, Azure, AWS).☆20Apr 27, 2026Updated last month
- Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval☆17Feb 13, 2026Updated 4 months ago
- An introduction to LLM Sampling☆80Dec 15, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 10 months ago
- Simple Transformer in Jax☆144Jun 22, 2024Updated last year
- Find all variables referenced and assigned in an expression☆17May 1, 2026Updated last month
- Finetune your embeddings in-browser☆34Apr 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- RASP-L in Haskell for my fellow rascals☆20Dec 3, 2023Updated 2 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- An agentic tool to configure Dockerfiles for any repo☆76Apr 30, 2026Updated last month
- Sparse Autoencoder Training Library☆57May 1, 2025Updated last year
- ☆63Jan 26, 2025Updated last year
- All-in-one RAG toolkit—from quick prototypes to advanced pipelines.☆33Nov 27, 2025Updated 6 months ago
- A toy text-to-image model trained from scratch.☆19Jun 9, 2025Updated last year
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Jun 19, 2023Updated 3 years ago
- User-friendly viewer for Parquet files☆13May 8, 2026Updated last month
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated 2 months ago
- ☆12Dec 28, 2021Updated 4 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- ☆24Jan 28, 2025Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Because it's there.☆16Sep 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Eden is building autonomous creative agents.☆30May 28, 2026Updated 3 weeks ago
- utilities for loading and running text embeddings with onnx☆45Aug 16, 2025Updated 10 months ago
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- code for training & evaluating Contextual Document Embedding models☆205May 14, 2025Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago