[NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios
☆28Dec 1, 2025Updated 3 months ago
Alternatives and similar repositories for AgentIF
Users that are interested in AgentIF are comparing it to the libraries listed below
Sorting:
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 9 months ago
- ☆41Feb 22, 2026Updated last week
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆15May 27, 2025Updated 9 months ago
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 7 months ago
- [EMNLP2024] Aligning Large Language Models on Information Extraction☆54Nov 4, 2024Updated last year
- Repositório para materiais e documentos relevantes usados na XI EMMSB.☆14Sep 3, 2024Updated last year
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- Code for the paper: Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness☆12Oct 22, 2023Updated 2 years ago
- [CIKM 2025] LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking☆17Sep 6, 2025Updated 5 months ago
- ☆11Nov 23, 2024Updated last year
- ☆15Apr 26, 2025Updated 10 months ago
- ☆15Nov 27, 2025Updated 3 months ago
- ☆12Apr 18, 2025Updated 10 months ago
- A VMD script to calculate side-chain and backbone dihedrals (torsion angles)☆14Jun 4, 2016Updated 9 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- ☆22Jan 15, 2026Updated last month
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 7 months ago
- Solidity Function Selector Miner (AVX2 + Multithreaded)☆10Nov 27, 2023Updated 2 years ago
- ASCIIFY can be used to convert images and videos to ASCII Art. I have made this project as a part of the IITR ACM- Open Summer Project, 2…☆11Jun 19, 2025Updated 8 months ago
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆48Sep 13, 2025Updated 5 months ago
- A model context protocol server for your Gmail☆17Jul 8, 2025Updated 7 months ago
- ☆11May 18, 2025Updated 9 months ago
- ☆26Feb 11, 2026Updated 3 weeks ago
- Sandbox your LLM coding agents on macOS so they can only touch the files they need☆27Feb 19, 2026Updated last week
- ☆14Mar 4, 2024Updated last year
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 8 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- ☆26Feb 8, 2026Updated 3 weeks ago
- Generate customized voxel representations of protein-ligand complexes using GPU.☆11Sep 8, 2025Updated 5 months ago
- aimd - AI Streaming Markdown Renderer☆14Jul 11, 2025Updated 7 months ago
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆25Jul 22, 2025Updated 7 months ago
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- Code used for articles published at Nvidia's Developer Blog☆11Jun 16, 2022Updated 3 years ago
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆21Jan 29, 2026Updated last month
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆16Updated this week
- A PyMOL plugin with accompanying Docker image for kinase inhibitor binding and affinity prediction☆12Jun 3, 2024Updated last year
- ☆62Oct 29, 2024Updated last year
- Unlimited LLM tools, zero context penalties — ToolRAG serves exactly the LLM tools your user-query demands.☆22Mar 29, 2025Updated 11 months ago