Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena"
☆49Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for auction-arena
Users that are interested in auction-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 9 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆22Oct 13, 2024Updated last year
- Code for the paper "Critical Thinking for Language Models"☆12Jun 1, 2021Updated 4 years ago
- ☆19May 17, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆15Aug 12, 2024Updated last year
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Apr 30, 2024Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆159Oct 19, 2024Updated last year
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆33Feb 21, 2024Updated 2 years ago
- ☆98Dec 5, 2023Updated 2 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- Codebase for EnterpriseOps-Gym from ServiceNow☆81Mar 25, 2026Updated 3 weeks ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Jun 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Count Tokens of Code (forked from gocloc)☆45Aug 19, 2024Updated last year
- ☆10Oct 17, 2021Updated 4 years ago
- Code for "Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies"☆37Apr 12, 2024Updated 2 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Code for "Neural Retrievers are Biased Towards LLM-Generated Content"☆14Oct 18, 2024Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- A Multi-graph Multi-head Adaptive Temporal Graph Convolutional Network☆11May 21, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- Code and data for experiments on semantic fragments☆11Jun 23, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆350Dec 3, 2025Updated 4 months ago
- Jupyter notebook that generates useful graphs and statistical metrics for analyzing and improving glycemic control.☆13Feb 5, 2025Updated last year
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- Implementation of scheme in python supporting call/cc and hygenic macros☆16Sep 12, 2015Updated 10 years ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- Neural machine translation implementation using dynet's python bindings☆17Jan 24, 2018Updated 8 years ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆35Mar 9, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆114Jun 16, 2023Updated 2 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- SHUbeamer是为了帮助上海大学师生撰写演示文稿而编写的LaTex Beamer模版文件☆10Dec 1, 2021Updated 4 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Code for EMNLP-2018 paper "Variational Autoregressive Decoder for Neural Response Generation"☆16Oct 11, 2019Updated 6 years ago
- ☆17Dec 11, 2023Updated 2 years ago
- Repo for "Centaur: Robust Multimodal Fusion for Human Activity Recognition"☆10Jan 9, 2024Updated 2 years ago