OSU-NLP-Group/Middleware

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OSU-NLP-Group/Middleware)

OSU-NLP-Group / Middleware

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)

☆37

Alternatives and similar repositories for Middleware

Users that are interested in Middleware are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
JHU-CLSP / turking-bench
View on GitHub
Web-grounded natural language instructions
☆18Nov 25, 2024Updated last year
OSU-NLP-Group / LLM-Knowledge-Conflict
View on GitHub
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆84Apr 12, 2024Updated 2 years ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
Timothyxxx / WorldModelPapers
View on GitHub
Paper collections of the continuous effort start from World Models.
☆216Jul 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆26Jul 28, 2025Updated 11 months ago
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
OSU-NLP-Group / WebDreamer
View on GitHub
[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
☆104Oct 5, 2025Updated 9 months ago
Timothyxxx / EnvInteractiveLMPapers
View on GitHub
Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…
☆128Jul 26, 2023Updated 2 years ago
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
zorazrw / trove
View on GitHub
[ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks
☆33Sep 20, 2024Updated last year
SivilTaram / code-html-to-markdown
View on GitHub
A lightweight script for processing HTML page to markdown format with support for code blocks
☆81Apr 14, 2024Updated 2 years ago
OSU-NLP-Group / SeeAct
View on GitHub
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…
☆851Feb 3, 2025Updated last year
google-research / arcade-nl2code
View on GitHub
☆55Aug 25, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OSU-NLP-Group / Mind2Web-2
View on GitHub
[NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
☆111May 17, 2026Updated 2 months ago
dki-lab / ArcaneQA
View on GitHub
☆23Aug 14, 2023Updated 2 years ago
neulab / MultiUI
View on GitHub
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆54Dec 12, 2024Updated last year
HKUNLP / SymGen
View on GitHub
[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models
☆18Oct 21, 2023Updated 2 years ago
OSU-NLP-Group / RedTeamCUA
View on GitHub
[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
☆57Feb 9, 2026Updated 5 months ago
GeoEval / GeoEval
View on GitHub
This is the Repository for Geometry Problem Solving Method Evaluation
☆27Oct 8, 2024Updated last year
TIGER-AI-Lab / KB-BINDER
View on GitHub
"Few-shot In-context Learning for Knowledge Base Question Answering" [ACL2023]
☆66Jan 27, 2025Updated last year
dsridhar91 / hstm
View on GitHub
Code and data for "Heterogeneous Supervised Topic Models"
☆10Jun 27, 2022Updated 4 years ago
chanhee-luke / M-Track
View on GitHub
Code for CVPR22 paper One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones
☆13Jul 27, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HKUNLP / subgoal-theorem-prover
View on GitHub
Code for the paper "Decomposing the Enigma: Subgoal-based Demonstration Learning for Formal Theorem Proving"
☆20May 25, 2023Updated 3 years ago
OSU-NLP-Group / UGround
View on GitHub
[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents
☆315Mar 11, 2026Updated 4 months ago
OSU-NLP-Group / SkillWeaver
View on GitHub
SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.
☆144Apr 14, 2025Updated last year
OSU-NLP-Group / AgentSafety
View on GitHub
☆192Oct 31, 2025Updated 8 months ago
facebookresearch / UniK-QA
View on GitHub
Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering
☆51Aug 2, 2022Updated 3 years ago
drogozhang / LED
View on GitHub
Source code of paper 'LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval' (WWW 2023)
☆22Aug 28, 2023Updated 2 years ago
OSU-NLP-Group / SeeActChromeExtension
View on GitHub
☆18Jan 3, 2025Updated last year
microsoft / simulated-trial-and-error
View on GitHub
☆124Jun 6, 2024Updated 2 years ago
xlang-ai / xlang-paper-reading
View on GitHub
Paper collection on building and evaluating language model agents via executable language grounding
☆364Apr 29, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
njucckevin / MM-Self-Improve
View on GitHub
A Self-Training Framework for Vision-Language Reasoning
☆90Jan 23, 2025Updated last year
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
JHU-CLSP / rockfish-tutorial
View on GitHub
☆10Mar 5, 2023Updated 3 years ago
himkt / allennlp-NER
View on GitHub
☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)
☆15Nov 26, 2020Updated 5 years ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
Timothyxxx / LMsMBTI
View on GitHub
A MBTI test on Large Language Model like GPT-3.
☆28May 2, 2022Updated 4 years ago
shadowkiller33 / Language_attack
View on GitHub
A repo for LLM jailbreak
☆14Sep 5, 2023Updated 2 years ago