BullshitBench measures whether AI models challenge nonsensical prompts instead of confidently answering them, created by Peter Gostev.
☆1,633May 20, 2026Updated this week
Alternatives and similar repositories for bullshit-benchmark
Users that are interested in bullshit-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A lifeline for people dealing with Windows, especially after using macOS.☆12Apr 24, 2022Updated 4 years ago
- Scrape reddit posts into a single markdown file☆12Jul 28, 2024Updated last year
- What if salsa but tokio-friendly☆52Updated this week
- ☆21Apr 23, 2025Updated last year
- Just some nice dice in Python☆21Jan 6, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experimenting and exploring Computer Vision with Deep Learning☆10Mar 29, 2025Updated last year
- Network for procedural editing of text with LLMs☆23Apr 28, 2026Updated 3 weeks ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Apr 15, 2026Updated last month
- ☆18Oct 21, 2024Updated last year
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Pipeline for generating RNAseq-based cancer patient reports☆12Apr 15, 2026Updated last month
- ☆17May 22, 2025Updated 11 months ago
- Pynocular is a lightweight ORM that lets you query your database using Pydantic models and asyncio☆11May 24, 2022Updated 3 years ago
- A pytorch version of hamiltonian monte carlo☆15Jun 26, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆40Updated this week
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and de…☆32May 11, 2026Updated last week
- Blacktie: a streamlined interface to the popular tophat/cufflinks RNA-seq pipeline☆26Oct 5, 2015Updated 10 years ago
- An ORM-Like interface for Google Cloud NoSQL Datastore☆13May 8, 2021Updated 5 years ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 2 years ago
- Revit MCP SDK☆30Sep 13, 2025Updated 8 months ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated 3 months ago
- Sentry CLI☆82Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 4 months ago
- ☆21Aug 26, 2025Updated 8 months ago
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jan 14, 2026Updated 4 months ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆39Feb 20, 2026Updated 3 months ago
- Opinionated typing package for precise type hints in Python☆82Updated this week
- various tools to download, convert and process the full text of scientific articles☆10Apr 2, 2024Updated 2 years ago
- A nice keyboard-oriented homepage, designed by committee^Wspec.☆13Jun 25, 2025Updated 10 months ago
- Recursive workflow for agentic engineering. Like Factory Missions but properly recursive, free and open source.☆97May 13, 2026Updated last week
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CNOE CLI☆10May 9, 2024Updated 2 years ago
- OpenAPI Processing API☆23Jan 28, 2026Updated 3 months ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 3 weeks ago
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- Channels between coroutines in Python☆15Jan 4, 2021Updated 5 years ago
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A fun PGM experience☆15May 19, 2025Updated last year