haizelabs / bijection-learning
☆22Updated 5 months ago
Alternatives and similar repositories for bijection-learning:
Users that are interested in bijection-learning are comparing it to the libraries listed below
- Sphynx Hallucination Induction☆53Updated 2 months ago
- Verdict is a library for scaling judge-time compute.☆192Updated 2 weeks ago
- Red-Teaming Language Models with DSPy☆178Updated last month
- ⚖️ Awesome LLM Judges ⚖️☆87Updated last month
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆99Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆89Updated 9 months ago
- ☆124Updated last week
- ☆48Updated last year
- ☆53Updated 6 months ago
- ☆67Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 6 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆67Updated 9 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆85Updated this week
- ☆109Updated 2 weeks ago
- Repository for the paper Stream of Search: Learning to Search in Language☆142Updated 2 months ago
- An attribution library for LLMs☆38Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆57Updated 2 weeks ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆92Updated last month
- ☆80Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- ☆31Updated last week
- ☆87Updated 2 weeks ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- ☆50Updated 4 months ago
- Letting Claude Code develop his own MCP tools :)☆93Updated 3 weeks ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 5 months ago
- PyTorch library for Active Fine-Tuning☆62Updated last month
- ☆111Updated last month