mazzzystar / TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
☆122Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for TurtleBench
- ☆50Updated 2 months ago
- ☆225Updated 8 months ago
- Multiple instructed-LLMs engage in multi-round "self-questioning" to seek the optimal solution, borrowing from the idea of debate, iterat…☆52Updated 3 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆42Updated 2 months ago
- Conversational Retrieval Evaluation Dataset☆91Updated last month
- ☆302Updated 3 weeks ago
- Scrape the webpage convert it into Markdown, and enhance AI search applications.☆232Updated 6 months ago
- The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆389Updated last month
- Build games with GPT☆308Updated 3 months ago
- A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处…☆194Updated this week
- ☆250Updated 3 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆72Updated 6 months ago
- The simplest open-source implementation of perplexity.ai☆259Updated 2 months ago
- Prompt 工程师利器,可同时比较多个 Prompts 在多个 LLM 模型上的效果☆97Updated last year
- Examples and guides for using the LLMs☆256Updated last year
- 📝 针对文档类图像做内容提取,将文档类图像一比一输出到Word或者Txt中,便于进一步使用或处理。后续计划支持输入PDF/图像,输出对应json格式、Txt格式、Word格式和Markdown格式。☆149Updated last week
- A simple and fast backend API, based on Hono, that can search for relevant content on the internet using keywords and convert it into a f…☆242Updated 6 months ago
- 这是一个基于 Next.js 构建的多语言 AI 模型评估平台,支持多模型对比和实时流式响应。A multilingual AI model evaluation platform built with Next.js, allowing users to compare …☆70Updated 3 weeks ago
- A Python Package to Access World-Class Generative Models☆124Updated 4 months ago
- WebDesignAgent : Towards Effortless Website Creation☆238Updated last month
- ☆217Updated last year
- Convert different model APIs into the OpenAI API format out of the box.☆143Updated 8 months ago
- o1-like Chain of Thoughts on claude-3-5-sonnet!☆68Updated last month
- 开发ing,将Dify接入飞书机器人☆86Updated 4 months ago
- Literally Better YouTube Summary 🎯☆199Updated 7 months ago
- HF🤗每日简报机器人☆34Updated this week
- 让 AI 设计 AI,让大模型帮助小模型进化,用魔法创造魔法! Empower Artificial Intelligence to sculpt its own kind, where colossal models gracefully usher the petit…☆94Updated last year
- ☆25Updated 2 months ago