hrwise-nlp/AppBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hrwise-nlp/AppBench)

hrwise-nlp / AppBench

This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction

☆16

Alternatives and similar repositories for AppBench

Users that are interested in AppBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hrwise-nlp / ToolsMeetLLMs
View on GitHub
☆33May 8, 2025Updated last year
LingweiMeng / QualifyingExamPreparing
View on GitHub
Qualifying Exam Preparing
☆18May 7, 2025Updated last year
ulab-uiuc / ToMAP
View on GitHub
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆25Sep 25, 2025Updated 9 months ago
qiancheng0 / Open-SMARTAgent
View on GitHub
The official repo for the code and data of paper SMART
☆42Feb 20, 2025Updated last year
YiCheng98 / Cooper
View on GitHub
This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…
☆28Mar 1, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
jmnian / WRAG
View on GitHub
Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"
☆16Oct 2, 2025Updated 9 months ago
chengyou-jia / AgentStore
View on GitHub
[ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant
☆46Dec 19, 2024Updated last year
snap-research / LargeGT
View on GitHub
Graph Transformers for Large Graphs
☆22Apr 26, 2024Updated 2 years ago
zhulishe / Quantitative-investment
View on GitHub
Use strategy in stock transaction for high revenue.
☆10Dec 24, 2015Updated 10 years ago
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
GasolSun36 / SURf
View on GitHub
[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆11Oct 11, 2024Updated last year
PlusLabNLP / VISCO
View on GitHub
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆13Jun 7, 2025Updated last year
mjy1111 / BAKE
View on GitHub
This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing
☆11May 25, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
rookie-joe / AutoPSV
View on GitHub
☆50Oct 28, 2024Updated last year
ulab-uiuc / AgentProtocols
View on GitHub
Opensource code for ICML 2026 poster
☆15Nov 26, 2025Updated 7 months ago
micklpl / stanford-crypto
View on GitHub
Solutions for all programming assignments from Stanford's University Online Cryptography Course (C#)
☆12Apr 12, 2015Updated 11 years ago
tmlr-group / ECON
View on GitHub
[ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"
☆39Nov 23, 2025Updated 7 months ago
SCIR-HI / LLE-INC
View on GitHub
Repository for AAAI 2024 paper "Manifold-based Verbalizer Space Re-embedding for Tuning-free Prompt-based Classification"
☆10Feb 6, 2024Updated 2 years ago
abekdwight / wipeyy
View on GitHub
Chrome Extension to watch video using Picture-in-Picture(Always on top Floating Mini Player). It also, some shortcuts suitable for playba…
☆14Mar 8, 2025Updated last year
QwenLM / ConsisEval
View on GitHub
☆14Jul 5, 2024Updated 2 years ago
zjunlp / SemEval2021Task4
View on GitHub
The 4th rank system of the SemEval 2021 Task4.
☆10May 7, 2022Updated 4 years ago
facebookresearch / ToolVerifier
View on GitHub
This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.
☆23Mar 11, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kodenii / Ref-Diff
View on GitHub
Ref-Diff: Zero-shot Referring Image Segmentation with Generative Models
☆21May 29, 2025Updated last year
yhao-wang / LLM-Knowledge-Boundary
View on GitHub
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆21Jul 31, 2023Updated 2 years ago
matthewrenze / jhu-concise-cot
View on GitHub
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆25Nov 25, 2024Updated last year
thomasperrot / aes-square-attack
View on GitHub
Homemade implementation of Square Attack against 4 rounds AES
☆14Feb 2, 2020Updated 6 years ago
dqxiu / KAssess
View on GitHub
☆14Oct 28, 2023Updated 2 years ago
wbopan / Awesome-EToDs-Survey
View on GitHub
Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs
☆14Nov 18, 2023Updated 2 years ago
Saibo-creator / transformers-CFG
View on GitHub
☆10Mar 1, 2025Updated last year
YuxueYang1204 / LearningNotes
View on GitHub
☆12Nov 7, 2022Updated 3 years ago
liunian-Jay / AgenticRAG-RL
View on GitHub
A minimal implementation of Agentic RAG using GRPO
☆17Jun 11, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
AI45Lab / Fake-Alignment
View on GitHub
☆17Mar 22, 2024Updated 2 years ago
jiahao42 / Simplified-Zhihu-Daily
View on GitHub
Android app for Zhihu Daily
☆15May 28, 2017Updated 9 years ago
DevoAllen / Awesome-Reasoning-Economy-Papers
View on GitHub
Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models
☆124Oct 16, 2025Updated 9 months ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
YiCheng98 / IntegrativeDecoding
View on GitHub
Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"
☆33Apr 12, 2025Updated last year
wangjq4214 / buaa-thesis
View on GitHub
The Typst template for BUAA thesis.
☆15Jul 8, 2026Updated 2 weeks ago
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year