likenneth / q_probe

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
37Updated 5 months ago

Related projects

Alternatives and complementary repositories for q_probe