mazzzystar/TurtleBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mazzzystar/TurtleBench)

mazzzystar / TurtleBench

TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.

☆162

Alternatives and similar repositories for TurtleBench

Users that are interested in TurtleBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wzk1015 / GPT-turtlesoup
View on GitHub
ChatGPT实现AI海龟汤，GPT出题、当玩家、当裁判
☆19Dec 25, 2023Updated 2 years ago
IAAR-Shanghai / SEAP
View on GitHub
☆22Jun 10, 2025Updated 10 months ago
lilakk / BLEUBERI
View on GitHub
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆32Jun 5, 2025Updated 11 months ago
julien-c / arxiv-to-hf
View on GitHub
Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page
☆26Jan 4, 2024Updated 2 years ago
aniketp02 / wav2lip_144x144
View on GitHub
☆10Feb 17, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
TokenixFinance / tokenix-launchpad
View on GitHub
launchpad smart contract create token, bonding curve, pool creation on solana
☆11Aug 8, 2024Updated last year
BriefCandle / netherscape
View on GitHub
EthGlobal Autonomous Worlds Hackathon Project -- permissionless, composable, and autonomous on-chain RPG with playable characters as foun…
☆11Jun 5, 2023Updated 2 years ago
chilingg / fasing
View on GitHub
汉字动态组字系统
☆22Mar 15, 2026Updated last month
AIM3-RUC / VideoIC
View on GitHub
Danmuku dataset
☆12Jul 7, 2023Updated 2 years ago
hyongtao-code / chatDB-dify
View on GitHub
It is a simple demo of chatDB workflow in dify.
☆24Dec 7, 2024Updated last year
MadeAgents / Hammer
View on GitHub
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
☆116Jun 13, 2025Updated 10 months ago
mirichy419 / MI.RICHY-BUG-BOT
View on GitHub
☆12Oct 8, 2024Updated last year
longtails / notes
View on GitHub
study notes for IT
☆11Feb 22, 2020Updated 6 years ago
LiqiangJing / TEAM
View on GitHub
[ACL 2023] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.
☆10Dec 19, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
serptail / CapScript-Youtube-Subtitle-Search-Tool
View on GitHub
A desktop app for searching YouTube captions across channels or videos, viewing transcripts synced to a video player, downloading clips a…
☆17Apr 10, 2026Updated 3 weeks ago
JimmyLv / telegroam
View on GitHub
Telegroam: a bridge between Roam and Telegram. 🤩 When Roam Research receives one message, it automatically returns a previously random …
☆20Oct 26, 2023Updated 2 years ago
o20n3 / Facebook-to-Telegram-Bot
View on GitHub
Telegram bot which scrapes posts from Facebook Pages to a Telegram channel.
☆18Apr 10, 2021Updated 5 years ago
alibaba-mmai-research / HiCo
View on GitHub
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆18Aug 10, 2022Updated 3 years ago
zwhong714 / PSFT
View on GitHub
[ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, co…
☆38Sep 9, 2025Updated 8 months ago
weavel-ai / Ape
View on GitHub
Your first AI prompt engineer
☆414Jul 1, 2025Updated 10 months ago
buke / quickjs-go-polyfill
View on GitHub
quickjs-go polyfill library
☆17Jun 20, 2025Updated 10 months ago
NovelQA / novelqa.github.io
View on GitHub
☆25Jan 1, 2025Updated last year
hiouttime / php-passkit
View on GitHub
💳 一个使用PHP创建Apple钱包凭证的库
☆26Aug 22, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ricklamers / x-ai-content-filter-groq
View on GitHub
Filter X content using LLM API requests, configurable, based on Groq API
☆131Aug 12, 2024Updated last year
ONEWateR / FlowMonitor
View on GitHub
流量监控
☆14Oct 26, 2014Updated 11 years ago
Jhaprince / MultiBully
View on GitHub
☆17Oct 2, 2024Updated last year
kyriemao / ChatRetriever
View on GitHub
☆13Apr 18, 2024Updated 2 years ago
dongyubin / douban
View on GitHub
豆瓣已看电影展示
☆14Dec 29, 2024Updated last year
hcompai / hai-cookbook
View on GitHub
H.AI cookbook provides code examples and guides to help developers use models developed by H Company.
☆77Feb 20, 2026Updated 2 months ago
karShetty / Torch-Sparse-Multiply
View on GitHub
PyTorch Memory Efficient Sparse Sparse Matrix Multiplication
☆12Aug 12, 2024Updated last year
IAAR-Shanghai / NewsBench
View on GitHub
[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Jou…
☆34Jun 25, 2024Updated last year
YF-W / FGRM-AMOD
View on GitHub
FGRM-AMOD: An Adaptive Multi-View Outlier Detection Algorithm based on Fuzzy Rough Set Multi-Granularity(2024,Code)
☆13May 20, 2025Updated 11 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
sajjad-sh33 / Polyp-SAM-2
View on GitHub
[ICHMS 2025] Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection
☆13Aug 13, 2024Updated last year
xunfeng1528 / GP-T
View on GitHub
Code for the paper "Spatial-temporal attention graph reinforcement learning for Safety Constrained Unit Commitment Dispatch"
☆10Sep 4, 2024Updated last year
Mihot7 / lenovo-ideapad-gaming-3-15ach6-open-core
View on GitHub
An open core EFI that will help you turn your ideapad gaming 3 (15ach6) into a Hackintosh!
☆13Aug 10, 2024Updated last year
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
galinaalperovich / ai_summary_tg_bot
View on GitHub
Telegram bot that summarizes the content of a given URL with AI model
☆17Feb 24, 2023Updated 3 years ago
Bug-Duck / LLMVision
View on GitHub
👀 AI-Powered animation generator based on VueMotion.
☆16Dec 6, 2024Updated last year
Moppu / SecretOfManaRandomizer
View on GitHub
Randomizer for Secret of Mana
☆18Feb 13, 2026Updated 2 months ago