thesephist / spectreLinks

Sparse autoencoders for Contra text embedding models

☆25

Alternatives and similar repositories for spectre

Users that are interested in spectre are comparing it to the libraries listed below

Sorting:

JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 9 months ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated 11 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆49Updated 8 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
xjdr-alt / muzero_sketch
☆38Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated last year
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆71Updated 5 months ago
Nearcyan / papers.day
papers.day
☆91Updated last year
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆98Updated 2 months ago
MaximeRivest / funnydspy
Vanilla-Python ergonomics on top of DSPy
☆33Updated last month
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 8 months ago
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆32Updated last year
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆177Updated 2 weeks ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
Algomancer / The-Daily-Train
Training Models Daily
☆17Updated last year
joshuacnf / Ctrl-G
☆87Updated 6 months ago
CG80499 / trlx-with-T5
[Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆47Updated 2 years ago
CyrusNuevoDia / llegos
A strongly typed Python DSL for developing message passing multi agent systems
☆53Updated last year
willccbb / localchat
☆14Updated 3 months ago