stalkermustang / llm-bulls-and-cows-benchmarkView external linksLinks
A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.
☆238Jan 31, 2025Updated last year
Alternatives and similar repositories for llm-bulls-and-cows-benchmark
Users that are interested in llm-bulls-and-cows-benchmark are comparing it to the libraries listed below
Sorting:
- ☆680May 11, 2025Updated 9 months ago
- My DS projects☆15Aug 6, 2025Updated 6 months ago
- Library for converting from RGB / GrayScale image to base64 and back.☆19Sep 19, 2022Updated 3 years ago
- Deep-Learning for Tidemark Segmentation in Human Osteochondral Tissues Imaged with Micro-computed Tomography☆43Aug 30, 2019Updated 6 years ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Jul 26, 2023Updated 2 years ago
- This is open-source implementation of MixedAE (https://arxiv.org/pdf/2303.17152.pdf)☆22Feb 14, 2025Updated last year
- PyTorch Implementation of the state-of-the-art model for object detection EfficientDet [pre-trained weights provided]☆22Oct 3, 2023Updated 2 years ago
- Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat☆61Sep 26, 2023Updated 2 years ago
- Phishing scams are more rampant than ever — and I wanted to do something about it. Over the last few weeks, I’ve been working on a proje…☆17Sep 8, 2025Updated 5 months ago
- ComfyUI custom node to extend Wan videos in loops with overlap consistency, per loop prompts, and optional LoRA control.☆25Nov 29, 2025Updated 2 months ago
- Enhancing Recommendation Systems with Large Language Models (RAG - LangChain - OpenAI)☆39Dec 28, 2024Updated last year
- Примеры distributed machine learning с помощью сервиса AICloud☆37Nov 18, 2025Updated 2 months ago
- ConfigParser class with AES-256 symmetric encryption support☆10Mar 8, 2024Updated last year
- Cl app / pre-commit hook to clean Jupyter Notebooks metadata, execution_count and optionally output.☆11Mar 3, 2025Updated 11 months ago
- A Python CLI tool that performs lossless removal of ALL watermarks and metadata from MP3 and WAV audio files.☆30Dec 14, 2025Updated 2 months ago
- Sequence Planner☆12Nov 17, 2017Updated 8 years ago
- An Arduino library for interfacing to Nixdorf BA63 VFD customer displays☆10Updated this week
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- ☆14Jan 29, 2023Updated 3 years ago
- ☆10Apr 13, 2023Updated 2 years ago
- Luxonis ML library which abstracts logging, tracking, and other useful functionalities.☆17Updated this week
- StumbleBar by StumbleUpon☆13Jan 25, 2018Updated 8 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- This Repository contains the Demo Script, Code for all the sessions which I will be doing in Year 2025☆12Jun 28, 2025Updated 7 months ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆17Nov 28, 2025Updated 2 months ago
- A decentralized liquidity platform on Pi Network.☆21May 25, 2025Updated 8 months ago
- Advanced practice with MCP☆24Nov 16, 2025Updated 3 months ago
- super small gpt implementation☆16Dec 15, 2024Updated last year
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool☆16Mar 5, 2025Updated 11 months ago
- This is a swagger plugin for moleculer-web, just simple.☆15Jan 4, 2023Updated 3 years ago
- A few examples for LMAX disruptor☆17Aug 1, 2011Updated 14 years ago
- A simple save file editor for the game "Generation Zero"☆14Apr 8, 2021Updated 4 years ago
- A curated list of startup incubators around the globe☆12Apr 5, 2017Updated 8 years ago
- Explorations on being together in the browser☆111Mar 28, 2019Updated 6 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated last year