JHU-CLSP/turking-bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JHU-CLSP/turking-bench)

JHU-CLSP / turking-bench

Web-grounded natural language instructions

☆18

Alternatives and similar repositories for turking-bench

Users that are interested in turking-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shulin16 / MMInA
View on GitHub
[ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents
☆54Feb 27, 2025Updated last year
RUCBM / GUICourse
View on GitHub
GUICourse: From General Vision Langauge Models to Versatile GUI Agents
☆143Mar 1, 2026Updated 4 months ago
tsafavi / cascader
View on GitHub
CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)
☆13Jun 17, 2022Updated 4 years ago
himkt / allennlp-NER
View on GitHub
☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)
☆15Nov 26, 2020Updated 5 years ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / simulated-trial-and-error
View on GitHub
☆124Jun 6, 2024Updated 2 years ago
OSU-NLP-Group / llm-planning-eval
View on GitHub
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Feb 23, 2024Updated 2 years ago
aviaefrat / lmentry
View on GitHub
☆15Nov 22, 2023Updated 2 years ago
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆40Oct 2, 2024Updated last year
m3hrdadfi / albert-persian-lab
View on GitHub
ALBERT Persian Playground
☆13Jun 12, 2023Updated 3 years ago
peterwestuw / GPT2ForwardBackward
View on GitHub
Code for running forward and backward versions of GPT2
☆10Nov 20, 2021Updated 4 years ago
tihu-nlp / normalized_bijankhan
View on GitHub
Normalized and modified version of Bijankhan corpus
☆13Feb 21, 2023Updated 3 years ago
albfan / mvnexec
View on GitHub
bash script to find and execute java classes with main methods
☆20Oct 24, 2025Updated 8 months ago
XianyiCheng / HiDex
View on GitHub
☆13Jun 30, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jeantil / blog-samples
View on GitHub
code pour les billets "Refactorer Future[Option[T]]" sur
☆12Jun 14, 2017Updated 9 years ago
lxasqjc / MCPL
View on GitHub
MCPL: MULTI-CONCEPT PROMPT LEARNING
☆20May 27, 2024Updated 2 years ago
zorazrw / odex
View on GitHub
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆49Dec 22, 2023Updated 2 years ago
ruyimarone / data-portraits
View on GitHub
Documenting large text datasets 🖼️ 📚
☆14Dec 17, 2024Updated last year
tongshuangwu / llm-crowdsourcing-pipeline
View on GitHub
☆11Jul 6, 2023Updated 3 years ago
HazyResearch / wonderbread
View on GitHub
WONDERBREAD benchmark + dataset for BPM tasks
☆35Jul 30, 2025Updated 11 months ago
RAIVNLab / mnms
View on GitHub
m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks
☆46Sep 26, 2024Updated last year
Felixgithub2017 / CG-Eval
View on GitHub
Chinese Generation Evaluation
☆13Aug 14, 2023Updated 2 years ago
xnancy / russ
View on GitHub
☆16Apr 9, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SemEval / SemEval2021
View on GitHub
☆29May 17, 2022Updated 4 years ago
crujzo / Para-Phrase
View on GitHub
Please visit this repo for enhanced and updated open source code
☆14Dec 14, 2025Updated 7 months ago
vishakhpk / mi-unsup-summ
View on GitHub
Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information
☆25Sep 11, 2021Updated 4 years ago
zharry29 / causal_reasoning_of_entities_and_events
View on GitHub
Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.
☆11May 26, 2023Updated 3 years ago
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
cdcrabtree / nomine
View on GitHub
Classify names by gender, U.S. ethnicity, or leaf nationality
☆19Oct 13, 2018Updated 7 years ago
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
xlang-ai / Spider2-V
View on GitHub
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
☆153Aug 26, 2024Updated last year
frankxu2004 / knnlm-why
View on GitHub
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Jan 12, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
scaleapi / browser-art
View on GitHub
☆37Mar 6, 2025Updated last year
barneygovan / lsh-scala
View on GitHub
A Locality-Sensitive Hashing Library for Scala with optional Redis storage.
☆17Jan 5, 2022Updated 4 years ago
Muennighoff / FLAN
View on GitHub
Provides a minimal implementation to extract FLAN datasets for further processing
☆11Feb 1, 2023Updated 3 years ago
nafg / gitlab-ci-runner-scala
View on GitHub
A runner for GitLab CI written in Scala (based on https://github.com/virtualmarc/gitlab-ci-runner-win)
☆19Jul 23, 2014Updated 11 years ago
wolfe-pack / moro
View on GitHub
Interactive documentation and programming with Scala, iPython notebook style.
☆19Mar 9, 2016Updated 10 years ago
christianscheible / qsample
View on GitHub
A natural language processing tool for automatically detecting quotations in text.
☆15Feb 26, 2022Updated 4 years ago
google-research / arcade-nl2code
View on GitHub
☆54Aug 25, 2023Updated 2 years ago