mazzzystar / TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.
☆145Updated 5 months ago
Alternatives and similar repositories for TurtleBench:
Users that are interested in TurtleBench are comparing it to the libraries listed below
- Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"☆264Updated 2 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆211Updated 3 months ago
- ☆50Updated 3 months ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆242Updated last month
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆154Updated 4 months ago
- Qwen GRPO Graph Extraction RL Finetune☆43Updated last month
- Conversational Retrieval Evaluation Dataset☆100Updated last month
- Multiple instructed-LLMs engage in multi-round "self-questioning" to seek the optimal solution, borrowing from the idea of debate, iterat…☆74Updated 7 months ago
- ☆220Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆43Updated 2 months ago
- ☆103Updated 3 months ago
- 顾名思义:手搓的RAG☆121Updated last year
- GLM Series Edge Models☆131Updated last month
- ☆427Updated last month
- A streamlined, user-friendly JSON streaming preprocessor, crafted in Python.☆97Updated 6 months ago
- Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-…☆278Updated 9 months ago
- Build games with GPT☆312Updated 8 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated 11 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆43Updated 7 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆49Updated this week
- Evaluation for AI apps and agent☆36Updated last year
- Convert different model APIs into the OpenAI API format out of the box.☆147Updated last year
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆77Updated last week
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆180Updated last month
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆260Updated 10 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆648Updated 7 months ago
- A Python Package to Access World-Class Generative Models☆128Updated 9 months ago
- 🌐 WebWalker: Benchmarking LLMs in Web Traversal☆378Updated 2 weeks ago
- 我们是第一个完全可商用的角色大模型。☆39Updated 7 months ago
- The Level-Navi Agent, a framework that requires no training and utilizes large language models for deep query understanding and precise s…☆78Updated 3 months ago