The evaluation code for MultiIF multi-turn and multi-lingual instruction following
☆63Oct 29, 2024Updated last year
Alternatives and similar repositories for Multi-IF
Users that are interested in Multi-IF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆53Aug 26, 2024Updated last year
- ☆27Jun 2, 2026Updated last week
- Code for AAAI 2023 research track paper "Question Decomposition Tree for Answering Complex Questions over Knowledge Bases"☆17Jan 3, 2024Updated 2 years ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆118Jun 12, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆74Feb 3, 2025Updated last year
- MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency☆32Sep 11, 2023Updated 2 years ago
- Evaluate the Quality of Critique☆37Jun 1, 2024Updated 2 years ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Nov 13, 2023Updated 2 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆17Feb 25, 2025Updated last year
- ☆14Aug 15, 2024Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆194Apr 29, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- CDNSim is a stream-level simulator written in Python, designed to simulate large content delivery networks.☆24Mar 8, 2019Updated 7 years ago
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated last year
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- Multi-Agent Reinforcement Learning☆11Jun 16, 2020Updated 5 years ago
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- ☆89Dec 29, 2023Updated 2 years ago
- Metaskill: A Meta-Skill for Autonomous AI Agent Team Generation☆50Feb 23, 2026Updated 3 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆415Jun 25, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆61Aug 2, 2023Updated 2 years ago
- High accuracy captcha solver for SJTU Jaccount login page using SVM and ResNet.☆14Nov 9, 2022Updated 3 years ago
- 基于 Go 的 HTTP 中继工具,为你的服务器请求 OpenAI 的 API 提供中继服务,也可用于搭建镜像站,开箱即用. Golang based HTTP relay server.☆11Apr 19, 2023Updated 3 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)☆24Feb 4, 2026Updated 4 months ago
- 清华树洞在被封禁之前的所有数据(All data publicated in THU tree hole during 2020 Spring to 2021 Winter)☆16Jul 30, 2022Updated 3 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 3 years ago
- Papers related to wireless large AI models and wireless foundation models.☆26May 16, 2025Updated last year
- [NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs☆49Nov 29, 2024Updated last year
- Code and Data for EMNLP 2023 Paper "MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Langu…☆14Apr 7, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆97Aug 15, 2023Updated 2 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆49Aug 13, 2025Updated 9 months ago
- ☆59Aug 22, 2024Updated last year
- ☆11Mar 3, 2026Updated 3 months ago