tomekkorbak / bliss-attractorsLinks
A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment
☆35Updated 5 months ago
Alternatives and similar repositories for bliss-attractors
Users that are interested in bliss-attractors are comparing it to the libraries listed below
Sorting:
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated last month
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated last year
- look how they massacred my boy☆63Updated last year
- ☆45Updated 6 months ago
- explore token trajectory trees on instruct and base models☆148Updated 6 months ago
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- ☆68Updated 6 months ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- ☆14Updated 7 months ago
- A graph visualization of attention☆57Updated 6 months ago
- anything you want can be built with morph cloud☆25Updated last month
- Plotting (entropy, varentropy) for small LMs☆99Updated 6 months ago
- Lego for GRPO☆30Updated 6 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated last month
- ☆35Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- Marketplace ML experiment - training without backprop☆27Updated 2 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 7 months ago
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆77Updated 3 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated last year
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆99Updated 4 months ago
- ☆107Updated last month
- Project code for training LLMs to write better unit tests + code☆21Updated 6 months ago
- ☆117Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Interactive timeline of AI history☆63Updated 2 months ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- ☆36Updated 2 weeks ago