Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena"
☆49Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for auction-arena
Users that are interested in auction-arena are comparing it to the libraries listed below
Sorting:
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆13Jun 22, 2025Updated 8 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Apr 30, 2024Updated last year
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆34Feb 21, 2024Updated 2 years ago
- [COLM'24] "How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?"☆22Oct 13, 2024Updated last year
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- ☆19May 17, 2025Updated 9 months ago
- ☆10Dec 14, 2020Updated 5 years ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- Problem-Oriented Segmentation and Retrieval EMNLP 2024 Findings☆34Nov 12, 2024Updated last year
- Code for the paper "Critical Thinking for Language Models"☆12Jun 1, 2021Updated 4 years ago
- Jupyter notebook that generates useful graphs and statistical metrics for analyzing and improving glycemic control.☆13Feb 5, 2025Updated last year
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- DALI Multi Agent System Framework☆42Jan 30, 2026Updated last month
- ☆17Dec 11, 2023Updated 2 years ago
- Generated emo rap lyrics in the style of xxxtentacion, JuiceWRLD and Lil Peep☆19Jul 10, 2020Updated 5 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- ☆21Mar 19, 2024Updated last year
- Real-time browser-based Voice Activity Detection (VAD) using JavaScript and the Web Audio API. A modular and easily expandable web applic…☆22Nov 7, 2023Updated 2 years ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Dec 6, 2025Updated 3 months ago
- Wire Removal Video Datasets 2(WRV2)☆46Jul 14, 2025Updated 7 months ago
- ☆36Oct 10, 2024Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆146Oct 19, 2024Updated last year
- Initialization using Update Approximation is a Silver Bullet for Extremely Efficient Low-Rank Fine-Tuning☆52Oct 17, 2025Updated 4 months ago
- ☆21Aug 30, 2025Updated 6 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI training☆37Mar 20, 2023Updated 2 years ago
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆26Aug 23, 2025Updated 6 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- Official code for "3HAN: A Deep Neural Network for Fake News Detection" (ICONIP 2017)☆88Jun 21, 2018Updated 7 years ago
- This repository contains a framework for converting monocular videos into side-by-side (SBS) 3D videos. It utilizes a combination of imag…☆90Feb 11, 2024Updated 2 years ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆56May 22, 2025Updated 9 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆69Sep 6, 2024Updated last year
- ☆99Dec 5, 2023Updated 2 years ago
- NExT-GPT: Any-to-Any Multimodal Large Language Model☆20Nov 3, 2024Updated last year
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- Cell Tracking metrics and tools for the CellTrackingChallenge as described in 'CHOTA: A Higher Order Accuracy Metric for Cell Tracking'☆29Jan 20, 2026Updated last month