google-deepmind / personality_in_llmsLinks
☆20Updated last month
Alternatives and similar repositories for personality_in_llms
Users that are interested in personality_in_llms are comparing it to the libraries listed below
Sorting:
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- Red-Teaming Language Models with DSPy☆235Updated 9 months ago
- ☆79Updated last month
- Sphynx Hallucination Induction☆53Updated 9 months ago
- ☆180Updated last week
- hyperstitional latent seeding☆22Updated 8 months ago
- Training-Ready RL Environments + Evals☆174Updated this week
- Specification for creating reliable LLM-based conversational agents☆63Updated 3 weeks ago
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆463Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆308Updated last week
- Letting Claude Code develop his own MCP tools :)☆123Updated 8 months ago
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆121Updated last year
- A system that tries to resolve all issues on a github repo with OpenHands.☆115Updated 11 months ago
- Simple demo showing how to use the Forge API by Nous Research☆14Updated last year
- Collection of evals for Inspect AI☆284Updated this week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆120Updated this week
- Keeping my personal experiments separate from the main repo☆69Updated 9 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆227Updated last year
- ☆80Updated last year
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆456Updated last year
- Open-source resources on agents for computer use.☆380Updated last month
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆25Updated 3 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆573Updated 3 months ago
- ☆83Updated last week
- A framework for generative software.☆114Updated 4 months ago
- A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating System…☆138Updated 6 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆92Updated last year
- Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.☆358Updated this week
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆281Updated 3 months ago