jnward / monosemanticity-reproLinks
Open source repro of "Towards Monosemanticity"
☆31Updated last year
Alternatives and similar repositories for monosemanticity-repro
Users that are interested in monosemanticity-repro are comparing it to the libraries listed below
Sorting:
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆246Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 8 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆289Updated 7 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- ☆136Updated 2 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆72Updated last year
- ☆46Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- ☆103Updated 9 months ago
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆32Updated last year
- ☆119Updated last year
- ☆142Updated last month
- ☆136Updated last year
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- ☆67Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago
- An introduction to LLM Sampling☆79Updated 10 months ago
- Approximation of the Claude 3 tokenizer by inspecting generation stream☆143Updated last year
- ☆162Updated 2 months ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- Evaluating LLMs with fewer examples☆164Updated last year
- Code for ExploreTom☆86Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆68Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆178Updated last year
- An attribution library for LLMs☆43Updated last year