BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.
☆1,732Jun 24, 2026Updated this week
Alternatives and similar repositories for bullshit-benchmark
Users that are interested in bullshit-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A lifeline for people dealing with Windows, especially after using macOS.☆12Apr 24, 2022Updated 4 years ago
- Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing rout…☆116Mar 31, 2026Updated 3 months ago
- Scrape reddit posts into a single markdown file☆12Jul 28, 2024Updated last year
- What if salsa but tokio-friendly☆51Jun 18, 2026Updated last week
- ☆21Apr 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- a tiny workbench for parsing, visualizing & analyzing linker map files.☆37Oct 4, 2025Updated 8 months ago
- ☆17Apr 6, 2023Updated 3 years ago
- Network for procedural editing of text with LLMs☆23Apr 28, 2026Updated 2 months ago
- Stolemojis never die. A collection of Slack emojis from past, present, and future companies.☆10Feb 5, 2026Updated 4 months ago
- ☆18Oct 21, 2024Updated last year
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 8 months ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆102Mar 6, 2026Updated 3 months ago
- Enhanced version of binaryninja-ollama and without using the ollama Python library☆13Jan 23, 2025Updated last year
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆65Mar 16, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An experimental and alternative approach to Finetuning and RAG.☆34Dec 9, 2023Updated 2 years ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Pipeline for generating RNAseq-based cancer patient reports☆13Jun 11, 2026Updated 2 weeks ago
- ☆17May 22, 2025Updated last year
- CSS injection requires an attacker to load a standalone CSS file to leak HTML tag attributes.☆21Apr 19, 2024Updated 2 years ago
- Simple, flexible, interactive & powerful charts, maps and gauges for .Net☆17Nov 27, 2023Updated 2 years ago
- A pytorch version of hamiltonian monte carlo☆15Jun 26, 2019Updated 7 years ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆20Jun 28, 2025Updated last year
- textwrap.dedent with t-string support☆24Dec 15, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is a read-only mirror of the CRAN R package repository. colorspace — A Toolbox for Manipulating and Assessing Colors and Palettes.…☆12Sep 22, 2025Updated 9 months ago
- A collection of molecular modelling tools for UCSF Chimera☆18Mar 26, 2019Updated 7 years ago
- A tutorial to build a Mesos framework that launches a web server in Go☆10May 27, 2016Updated 10 years ago
- A framework for evaluating function calls made by LLMs☆41Jul 23, 2024Updated last year
- ☆18Mar 18, 2024Updated 2 years ago
- Evaluation framework for document processing models and services.☆76May 28, 2026Updated last month
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 3 years ago
- Font subsetting for Rust (C++ present only for woff2 compression)☆40Jun 18, 2026Updated last week
- Semantic analysis engine for detecting vulnerability fixes in Windows kernel driver patches — 58 YAML rules, Ghidra decompilation, reacha…☆63Feb 26, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Vecty + three.js = ♡☆11Aug 26, 2018Updated 7 years ago
- Sentry CLI☆89Updated this week
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 4 months ago
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆37Apr 3, 2023Updated 3 years ago
- ☆55Updated this week
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- A collection of Python agent samples built with the Google Agent Development Kit (ADK), demonstrating integrations with services like B…☆21May 8, 2026Updated last month