timfduffy / syco-benchLinks
Benchmark to estimate model sycophancy
☆19Updated 5 months ago
Alternatives and similar repositories for syco-bench
Users that are interested in syco-bench are comparing it to the libraries listed below
Sorting:
- An API for simplifying X requests for a single authenticated account☆24Updated 9 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆98Updated this week
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆126Updated 2 weeks ago
- Base mech☆39Updated last week
- Modular Agentic AI Architecture - NousResearch x Teleport (Flashbots)☆71Updated 9 months ago
- Sparse autoencoders for Contra text embedding models☆25Updated last year
- hehe☆77Updated 11 months ago
- A replication of Andy Ayrey's "Backrooms" (https://dreams-of-an-electric-mind.webflow.io/), but runnable with Opus 3, Sonnet 3.5, GPT 4o,…☆109Updated 5 months ago
- A simulated operating system design for AI Agents to interact with the world☆171Updated 9 months ago
- Private inference over your sensitive data with off-the-shelf models☆35Updated 2 years ago
- Where AI Gets Real☆20Updated this week
- Solidity contracts for the decentralized Prime Network protocol☆27Updated 3 months ago
- Python SDK for FirstBatch: Real-time personalization using vectorDBs☆17Updated last year
- ☆17Updated last year
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆47Updated 6 months ago
- ☆70Updated 2 weeks ago
- An AI agent that can predict the future.☆49Updated 2 months ago
- A minimalist notepad for thinkers. Ephemeral notes for ephemeral thoughts.☆45Updated 2 years ago
- summaries of ai research☆46Updated 5 months ago
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆126Updated 2 months ago
- An MCP (Model Context Protocol) server that provides Ethereum blockchain data tools via Etherscan's API. Features include checking ETH ba…☆27Updated 9 months ago
- Data: Ecosystem news, GitHub updates, discussion summaries, and other useful bits for knowledge / RAG systems☆45Updated last week
- cli loom that uses git to manage branches☆27Updated 9 months ago
- train entropix like a champ!☆20Updated last year
- Deertick Agent Management and Integration Toolbox (DAMIT)☆22Updated 10 months ago
- A quickstart for the trader agent for AI prediction markets on Gnosis☆75Updated 4 months ago
- ☆135Updated 6 months ago
- A framework for the creation of autonomous agent services.☆106Updated this week
- command loom interface☆110Updated 8 months ago
- AGI Guild☆16Updated last year