Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena"
☆49Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for auction-arena
Users that are interested in auction-arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- [COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?☆22Oct 13, 2024Updated last year
- Code for the paper "Critical Thinking for Language Models"☆13Jun 1, 2021Updated 4 years ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Apr 30, 2024Updated 2 years ago
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆33Feb 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆98Dec 5, 2023Updated 2 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Jun 29, 2024Updated last year
- ☆10Oct 17, 2021Updated 4 years ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- My personal response to OpenAI's Grant Challenge☆29Jun 13, 2023Updated 2 years ago
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Multi-graph Multi-head Adaptive Temporal Graph Convolutional Network☆11May 21, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- Code and data for experiments on semantic fragments☆11Jun 23, 2022Updated 3 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- Implementation of scheme in python supporting call/cc and hygenic macros☆16Sep 12, 2015Updated 10 years ago
- HELP: a dataset for Handling Entailments with Lexical and logical Phenomena (Ver.1.0)☆15Jul 20, 2023Updated 2 years ago
- Neural machine translation implementation using dynet's python bindings☆17Jan 24, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆35Mar 9, 2026Updated 2 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- ☆17Dec 11, 2023Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆27Aug 23, 2025Updated 8 months ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- ☆14Jul 17, 2024Updated last year
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆510Nov 7, 2025Updated 6 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- IMGenie는 일상 이미지를 기반으로 어울리는 음악을 추천해주는 서비스입니다.☆20Oct 6, 2023Updated 2 years ago
- A tool library for riichi mahjong written in Rust, made mostly to be used as a WASM component.☆12Aug 29, 2025Updated 8 months ago
- This is the code repo for the paper AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play (NeurIPS 2025 Spotl…☆25Sep 29, 2025Updated 7 months ago
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago
- ☆11Feb 4, 2022Updated 4 years ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24May 24, 2022Updated 3 years ago
- ☆14May 7, 2021Updated 5 years ago