haizelabs / sphynx
Sphynx Hallucination Induction
☆52Updated 3 weeks ago
Alternatives and similar repositories for sphynx:
Users that are interested in sphynx are comparing it to the libraries listed below
- Red-Teaming Language Models with DSPy☆168Updated last week
- ☆20Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆78Updated this week
- ☆48Updated last year
- An attribution library for LLMs☆37Updated 5 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆21Updated last month
- A better way of testing, inspecting, and analyzing AI Agent traces.☆28Updated this week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆70Updated 3 weeks ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 4 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆89Updated 8 months ago
- ☆61Updated 3 weeks ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆71Updated 4 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 3 months ago
- Synthetic Data for LLM Fine-Tuning☆111Updated last year
- ☆50Updated 2 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 6 months ago
- Routing on Random Forest (RoRF)☆114Updated 4 months ago
- Chat Markup Language conversation library☆55Updated last year
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated 8 months ago
- A framework-less approach to robust agent development.☆154Updated this week
- Track the progress of LLM context utilisation☆53Updated 7 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 4 months ago