stalkermustang/llm-bulls-and-cows-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stalkermustang/llm-bulls-and-cows-benchmark)

stalkermustang / llm-bulls-and-cows-benchmark

A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.

☆234

Alternatives and similar repositories for llm-bulls-and-cows-benchmark

Users that are interested in llm-bulls-and-cows-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IlyaGusev / memetron3000
View on GitHub
LLM-based meme generator with templates
☆14Dec 1, 2025Updated 7 months ago
catalyst-team / bert
View on GitHub
A barebones (Distil)BERT pipeline for token classification tasks driven by catalyst
☆13Oct 14, 2019Updated 6 years ago
DenisSergeevitch / chatgpt-custom-instructions
View on GitHub
My own Prompts for Custom instructions ChatGPT
☆2,769Mar 14, 2026Updated 4 months ago
onixlas / DS_portfolio
View on GitHub
My DS projects
☆16Aug 6, 2025Updated 11 months ago
alxmamaev / sdsj-automl
View on GitHub
Sberbank Data Science Jorney Auto-ML competition
☆29Dec 26, 2018Updated 7 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
averkij / multipunct
View on GitHub
Train punctuation and capitalization models for different languages
☆26Apr 2, 2022Updated 4 years ago
vamplabAI / sgr-agent-core
View on GitHub
Schema-Guided Reasoning (SGR) has agentic system design created by neuraldeep community
☆1,112Updated this week
langswap-app / langswap
View on GitHub
Self-hosted AI video dubbing with ASR, translation, voice cloning, subtitles, and local GPU inference.
☆34Jun 22, 2026Updated last month
ternaus / base64ToImageConverters
View on GitHub
Library for converting from RGB / GrayScale image to base64 and back.
☆19Sep 19, 2022Updated 3 years ago
all-mute / yc-ai-tg-demo
View on GitHub
☆14Nov 29, 2024Updated last year
alxmamaev / theloop
View on GitHub
model-in-the-loop
☆42Aug 6, 2019Updated 6 years ago
burrsettles / readability
View on GitHub
Text readability metrics in Python.
☆11Aug 29, 2013Updated 12 years ago
aladin2907 / overhuman
View on GitHub
Self-evolving AI daemon in Go with fully generative UI. LLM generates unique HTML/ANSI interfaces from scratch for every response — not t…
☆17Apr 9, 2026Updated 3 months ago
Karthik-S-EC / Image-Quality-Assesment-Python-OpenCV
View on GitHub
No Reference BRISQUE technique to find the quality of an Image Using OpenCV
☆13Jul 21, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
stalkermustang / bcdc_ds_takehome
View on GitHub
Blockchain.com Data Scientist TakeHome (February 2022)
☆44Jan 16, 2023Updated 3 years ago
EvilFreelancer / openapi-to-mcp
View on GitHub
Turns any OpenAPI/Swagger API into an MCP server. One MCP tool per endpoint, Streamable HTTP - for AI clients calling your REST API.
☆16Feb 21, 2026Updated 5 months ago
princeton-nlp / InstructEval
View on GitHub
[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.
☆24Jul 26, 2023Updated 2 years ago
ImperialCollegeLondon / lean-maths-examples
View on GitHub
Some theorems presented to first and second year mathematics undergraduates, First and second year undergraduate level mathematics
☆13Jan 16, 2022Updated 4 years ago
mabotkin / zpordle
View on GitHub
A number guessing game with a p-adic twist.
☆12Oct 13, 2023Updated 2 years ago
trustbit / RAGathon
View on GitHub
☆97Oct 3, 2024Updated last year
Vilin97 / linear-algebra-done-right
View on GitHub
☆12Jun 30, 2022Updated 4 years ago
hse-cs / probaforms
View on GitHub
Conditional normalizing flows (NFs), conditional GANs, and conditional variational autoencoders (CVAEs) with sklearn-like interface
☆29Sep 5, 2024Updated last year
MindSetLib / Insolver
View on GitHub
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
☆19Jan 21, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
KPEKEP / universal-llm-chatbot
View on GitHub
Universal LLM Telegram chatbot in Python
☆17Aug 16, 2024Updated last year
yandex-ai-studio / yandex-ai-studio-api-examples
View on GitHub
☆17Apr 11, 2026Updated 3 months ago
MyLtYkRiTiK / ComputerVision_Tutorials_in_Russian
View on GitHub
☆90Aug 30, 2019Updated 6 years ago
uw-math-ai / TheoremSearch
View on GitHub
Semantic Search Over 9 million Mathematical Theorems
☆20Updated this week
Yorko / huggingface_text2image_yorko
View on GitHub
HuggingFace entry exercise by Yury Kashnitsky
☆14Aug 25, 2023Updated 2 years ago
natasha / navec
View on GitHub
Compact high quality word embeddings for Russian language
☆218Apr 13, 2026Updated 3 months ago
dengyang17 / PACIFIC
View on GitHub
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance
☆14May 15, 2024Updated 2 years ago
vishakhpk / verify_citations
View on GitHub
Code to verify citations in a bibtex file
☆15Mar 14, 2026Updated 4 months ago
deep-spin / sparse_continuous_distributions
View on GitHub
This repository provides open-source code for sparse continuous distributions and corresponding Fenchel-Young losses.
☆15May 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
trustbit / erc3-agents
View on GitHub
Sample agents for Enterprise RAG Challenge 3: AI Agents in Action
☆47Dec 9, 2025Updated 7 months ago
sergebulaev / telegram-channel-saver
View on GitHub
Access, download and browse telegram group members and content.
☆113Jan 3, 2026Updated 6 months ago
taherfattahi / recommendation-systems-by-llms
View on GitHub
Enhancing Recommendation Systems with Large Language Models (RAG - LangChain - OpenAI)
☆42Dec 28, 2024Updated last year
nizhib / portrait-demo
View on GitHub
Portrait Segmentation Demo
☆43Jul 9, 2025Updated last year
AbdualimovTP / nona
View on GitHub
library for filling in missing values using artificial intelligence methods
☆18Jan 8, 2023Updated 3 years ago
alex000kim / latex_resume_template
View on GitHub
Minimalist Latex Resume Template
☆66May 22, 2021Updated 5 years ago
vyomakesh09 / longagent
View on GitHub
LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration
☆11Mar 11, 2024Updated 2 years ago