[NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
☆37Dec 1, 2025Updated 7 months ago
Alternatives and similar repositories for AgentIF
Users that are interested in AgentIF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆19May 23, 2025Updated last year
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆15May 27, 2025Updated last year
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆56Nov 4, 2024Updated last year
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆56Mar 30, 2026Updated 3 months ago
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 11 months ago
- Based on EEG signal, the differential entropy feature is extracted, and the convolution neural network based on time domain network and a…☆13Sep 21, 2022Updated 3 years ago
- ☆11Nov 23, 2024Updated last year
- ☆43Jun 26, 2024Updated 2 years ago
- 北京工业大学 嵌入式系统的4个实践项目以及综合项目☆11Apr 26, 2023Updated 3 years ago
- [CIKM 2025] LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking☆18Sep 6, 2025Updated 9 months ago
- ☆62Oct 29, 2024Updated last year
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆27Apr 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python Implementation of Hierarchical Mesh Decomposition using Fuzzy Clustering and Cuts☆16Dec 17, 2022Updated 3 years ago
- Multi-Agent Reinforcement Learning☆11Jun 16, 2020Updated 6 years ago
- ☆60Dec 9, 2022Updated 3 years ago
- [ACL 2025] The official repository for "HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Res…☆22Feb 27, 2025Updated last year
- Knowledge Oriented Programming Language☆85Aug 12, 2022Updated 3 years ago
- Code for the paper: Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness☆12Oct 22, 2023Updated 2 years ago
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 4 years ago
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated last year
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆12Sep 21, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A VMD script to calculate side-chain and backbone dihedrals (torsion angles)☆14Jun 4, 2016Updated 10 years ago
- LLM plugin for interacting with llama-server models☆31May 28, 2025Updated last year
- Genarris is a random molecular crystal structure generator.☆33May 26, 2026Updated last month
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆121Mar 14, 2026Updated 3 months ago
- Source code for EMNLP 2023 paper "Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions".☆23Mar 21, 2024Updated 2 years ago
- ☆13Nov 7, 2023Updated 2 years ago
- Repositório para materiais e documentos relevantes usados na XI EMMSB.☆20May 7, 2026Updated last month
- Completing the Puzzle of All-in-One Event Understanding Benchmark with Event Arguments☆14Mar 12, 2024Updated 2 years ago
- Generate customized voxel representations of protein-ligand complexes using GPU.☆11Sep 8, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆17Dec 14, 2023Updated 2 years ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 7 months ago
- Process BindingDB☆10Nov 19, 2015Updated 10 years ago
- A PyMOL plugin with accompanying Docker image for kinase inhibitor binding and affinity prediction☆12Jun 3, 2024Updated 2 years ago
- ☆30Apr 17, 2025Updated last year
- The document for pdfdeal☆28Feb 20, 2025Updated last year
- A program repair tool which modifies any bugged Python script based on cues from rest of program.☆20Jun 14, 2021Updated 5 years ago