jnward / monosemanticity-reproLinks

☆31

Alternatives and similar repositories for monosemanticity-repro

Users that are interested in monosemanticity-repro are comparing it to the libraries listed below

Sorting:

Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆242Updated 5 months ago
joshuacnf / Ctrl-G
☆88Updated 7 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
google-deepmind / mishax
☆136Updated 4 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 10 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆196Updated 2 months ago
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆37Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 6 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
allenai / infinigram-api
☆76Updated this week
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆284Updated 5 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆146Updated 5 months ago
leap-laboratories / PIZZA
An attribution library for LLMs
☆42Updated 10 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 8 months ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆53Updated 6 months ago
NousResearch / finetuning-subnet
☆121Updated last year
zbambergerNLP / strategic-debate-tot
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆88Updated 10 months ago
geronimi73 / qlora-minimal
☆86Updated last year
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆104Updated 5 months ago
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆65Updated last year
QuixiAI / spectrum
☆131Updated 4 months ago
NousResearch / StripedHyenaTrainer
☆61Updated last year
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆64Updated 9 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆104Updated 5 months ago