APIBench is a benchmark for evaluating the performance of API recommendation approaches released in the paper "Revisiting, Benchmarking and Exploring APIRecommendation: How Far Are We?".
☆66Apr 3, 2023Updated 2 years ago
Alternatives and similar repositories for APIBench
Users that are interested in APIBench are comparing it to the libraries listed below
Sorting:
- ☆12Oct 29, 2022Updated 3 years ago
- ☆14Mar 13, 2021Updated 4 years ago
- ☆17Dec 9, 2022Updated 3 years ago
- Code and data for the paper: Apathetic or Empathetic? Evaluating LLMs' Emotional Alignments with Humans☆119Jan 25, 2026Updated last month
- ☆18Apr 15, 2024Updated last year
- ☆19Dec 8, 2022Updated 3 years ago
- ☆20Mar 6, 2023Updated 2 years ago
- A dataset of reproducible breaking dependency updates, SANER 2024 (https://doi.org/10.1109/SANER60148.2024.00024)☆21Feb 20, 2026Updated last week
- Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments☆95Jan 26, 2026Updated last month
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆54Feb 22, 2026Updated last week
- ☆56Aug 10, 2024Updated last year
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆110Mar 21, 2024Updated last year
- Chinese Vision-Language Understanding Evaluation☆23Dec 26, 2024Updated last year
- ☆21May 5, 2020Updated 5 years ago
- This is the code repository for our ICPC 2021 paper "Improving Code Summarization with Block-wise Abstract Syntax Tree Splitting"☆24Jan 3, 2023Updated 3 years ago
- A small and fast image rescaling library with SIMD support☆22Aug 11, 2025Updated 6 months ago
- ☆37Jan 25, 2024Updated 2 years ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Apr 17, 2023Updated 2 years ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆71Jan 15, 2026Updated last month
- Dump the call graph by the static analysis of FlowDroid☆23Jun 22, 2017Updated 8 years ago
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆36Sep 18, 2025Updated 5 months ago
- [ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models☆119Jun 12, 2025Updated 8 months ago
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆33May 3, 2023Updated 2 years ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆286Aug 19, 2023Updated 2 years ago
- For our ICSE23 paper "KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair" by Nan Jiang, Thibaud Lutellier, Yiling…☆33Sep 28, 2023Updated 2 years ago
- A curated list of software engineering research, data set, tool.☆33Dec 16, 2022Updated 3 years ago
- ☆30Nov 23, 2020Updated 5 years ago
- Functional clone detection(currently maintained version)☆34Sep 30, 2022Updated 3 years ago
- code for "Natural Language to Code Translation with Execution"☆41Nov 2, 2022Updated 3 years ago
- 中文大语言模型评测第三期☆35Dec 30, 2025Updated 2 months ago
- COMS30017 Computational Neuroscience☆11Jan 7, 2022Updated 4 years ago
- A library for building intraprocedural PDGs for Java programs☆36Sep 28, 2023Updated 2 years ago
- An Intellij Plugin that generates unit test methods with meaningful names based in described behaviours with @should tags in methods ja…☆10Dec 14, 2025Updated 2 months ago
- VulTrigger is a tool to for identifying vulnerability-triggering statements across functions and investigating the effectiveness of funct…☆42Dec 29, 2023Updated 2 years ago
- Code for generating the JuICe dataset.☆37Oct 27, 2021Updated 4 years ago
- This repo is the implementation of the paper "GraphSearchNet: Enhancing GNNs via Capturing Global Dependency for Semantic Code Search". W…☆32Dec 31, 2022Updated 3 years ago
- CFG based program similarity using Graph Neural Networks☆36Mar 21, 2023Updated 2 years ago
- Code for a web demo of Plan, Write, and Revise: a neural system for interactive open-domain story generation☆34Oct 25, 2021Updated 4 years ago
- ☆44Jun 24, 2025Updated 8 months ago