TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
☆162Oct 16, 2024Updated last year
Alternatives and similar repositories for TurtleBench
Users that are interested in TurtleBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatGPT实现AI海龟汤,GPT出题、当玩家、当裁判☆19Dec 25, 2023Updated 2 years ago
- ☆22Jun 10, 2025Updated 10 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated 11 months ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Jan 4, 2024Updated 2 years ago
- ☆10Feb 17, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- launchpad smart contract create token, bonding curve, pool creation on solana☆11Aug 8, 2024Updated last year
- EthGlobal Autonomous Worlds Hackathon Project -- permissionless, composable, and autonomous on-chain RPG with playable characters as foun…☆11Jun 5, 2023Updated 2 years ago
- 汉字动态组字系统☆22Mar 15, 2026Updated last month
- Danmuku dataset☆12Jul 7, 2023Updated 2 years ago
- It is a simple demo of chatDB workflow in dify.☆24Dec 7, 2024Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆116Jun 13, 2025Updated 10 months ago
- ☆12Oct 8, 2024Updated last year
- study notes for IT☆11Feb 22, 2020Updated 6 years ago
- [ACL 2023] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation.☆10Dec 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A desktop app for searching YouTube captions across channels or videos, viewing transcripts synced to a video player, downloading clips a…☆17Apr 10, 2026Updated 3 weeks ago
- Telegroam: a bridge between Roam and Telegram. 🤩 When Roam Research receives one message, it automatically returns a previously random …☆20Oct 26, 2023Updated 2 years ago
- Telegram bot which scrapes posts from Facebook Pages to a Telegram channel.☆18Apr 10, 2021Updated 5 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆18Aug 10, 2022Updated 3 years ago
- [ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, co…☆38Sep 9, 2025Updated 8 months ago
- Your first AI prompt engineer☆414Jul 1, 2025Updated 10 months ago
- quickjs-go polyfill library☆17Jun 20, 2025Updated 10 months ago
- ☆25Jan 1, 2025Updated last year
- 💳 一个使用PHP创建Apple钱包凭证的库☆26Aug 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Filter X content using LLM API requests, configurable, based on Groq API☆131Aug 12, 2024Updated last year
- 流量监控☆14Oct 26, 2014Updated 11 years ago
- ☆17Oct 2, 2024Updated last year
- ☆13Apr 18, 2024Updated 2 years ago
- 豆瓣已看电影展示☆14Dec 29, 2024Updated last year
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆77Feb 20, 2026Updated 2 months ago
- PyTorch Memory Efficient Sparse Sparse Matrix Multiplication☆12Aug 12, 2024Updated last year
- [ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Jou…☆34Jun 25, 2024Updated last year
- FGRM-AMOD: An Adaptive Multi-View Outlier Detection Algorithm based on Fuzzy Rough Set Multi-Granularity(2024,Code)☆13May 20, 2025Updated 11 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICHMS 2025] Polyp SAM 2: Advancing Zero shot Polyp Segmentation in Colorectal Cancer Detection☆13Aug 13, 2024Updated last year
- Code for the paper "Spatial-temporal attention graph reinforcement learning for Safety Constrained Unit Commitment Dispatch"☆10Sep 4, 2024Updated last year
- An open core EFI that will help you turn your ideapad gaming 3 (15ach6) into a Hackintosh!☆13Aug 10, 2024Updated last year
- ☆14Dec 9, 2021Updated 4 years ago
- Telegram bot that summarizes the content of a given URL with AI model☆17Feb 24, 2023Updated 3 years ago
- 👀 AI-Powered animation generator based on VueMotion.☆16Dec 6, 2024Updated last year
- Randomizer for Secret of Mana☆18Feb 13, 2026Updated 2 months ago