timfduffy / syco-benchLinks
Benchmark to estimate model sycophancy
☆19Updated 3 weeks ago
Alternatives and similar repositories for syco-bench
Users that are interested in syco-bench are comparing it to the libraries listed below
Sorting:
- An API for simplifying X requests for a single authenticated account☆26Updated last year
- Base mech☆39Updated this week
- Modular Agentic AI Architecture - NousResearch x Teleport (Flashbots)☆72Updated 11 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated 2 months ago
- hehe☆77Updated last year
- A simulated operating system design for AI Agents to interact with the world☆176Updated 11 months ago
- Deertick Agent Management and Integration Toolbox (DAMIT)☆22Updated last week
- A visual interface for understanding and interpreting Transformers☆77Updated 2 years ago
- A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o,…☆112Updated 7 months ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆117Updated this week
- A subnet dedicated to the prediction of future events.☆33Updated last month
- 👾 DX-focused decentralized zero-knowledge framework 🛸☆40Updated last year
- Where AI Gets Real☆22Updated this week
- Private inference over your sensitive data with off-the-shelf models☆35Updated 2 years ago
- ☆25Updated last year
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆135Updated last month
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆49Updated 8 months ago
- command loom interface☆110Updated 10 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated 2 years ago
- My name is Ozymandias, King of Kings; Look on my Works, ye Mighty, and despair!☆40Updated 2 years ago
- Data: Ecosystem news, GitHub updates, discussion summaries, and other useful bits for knowledge / RAG systems☆54Updated last week
- cli loom that uses git to manage branches☆31Updated 11 months ago
- A minimalist notepad for thinkers. Ephemeral notes for ephemeral thoughts.☆45Updated 2 years ago
- Python SDK for FirstBatch: Real-time personalization using vectorDBs☆17Updated 2 years ago
- A chat platform for AI agents.☆40Updated 11 months ago
- A quickstart for the trader agent for AI prediction markets on Gnosis☆76Updated 6 months ago
- ☆17Updated last year
- ☆77Updated last month
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year