jinhaoduan/GTBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jinhaoduan/GTBench)

jinhaoduan / GTBench

[NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

☆70

Alternatives and similar repositories for GTBench

Users that are interested in GTBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KaidiXu / LiRPA_Verify
View on GitHub
Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"
☆17Jan 27, 2023Updated 3 years ago
sunsmarterjie / ChatterBox
View on GitHub
[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues
☆61May 2, 2025Updated last year
kong13661 / PIA
View on GitHub
Official repo for An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization
☆16Mar 8, 2024Updated 2 years ago
rosewang2008 / backtracing
View on GitHub
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆91Jul 21, 2024Updated 2 years ago
ysh-1998 / CoWPiRec
View on GitHub
The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.
☆25Jan 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Babelscape / LLM-Oasis
View on GitHub
This repository contains the resource introduced in the paper: "Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis"…
☆25Oct 15, 2025Updated 9 months ago
CUHK-ARISE / GAMABench
View on GitHub
Code and data for the paper: Competing Large Language Models in Multi-Agent Gaming Environments
☆98Jan 26, 2026Updated 6 months ago
doslim / Sentiment-Analysis-SST5
View on GitHub
An LSTM model implemented by PyTorch to perform sentiment classification on the Stanford Sentiment Treebank (SST-5) dataset.
☆12Sep 13, 2022Updated 3 years ago
xlite-dev / qwen-image-fast
View on GitHub
⚡️Qwen-Image 4.8x🎉 speedup with Hybrid Acceleration for low VRAM GPUs
☆17Oct 24, 2025Updated 9 months ago
KaidiXu / Beta-CROWN
View on GitHub
β-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Verification
☆31Nov 9, 2021Updated 4 years ago
AAAI-DISIM-UnivAQ / DALI
View on GitHub
DALI Multi Agent System Framework
☆43Mar 24, 2026Updated 4 months ago
lambert-x / ProLab
View on GitHub
Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…
☆55Aug 27, 2025Updated 10 months ago
felixcheng97 / AGAP
View on GitHub
[3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing
☆33Feb 13, 2025Updated last year
AISG-Technology-Team / AISG-Online-Safety-Challenge-Submission-Guide
View on GitHub
Submission Guide + Discussion Board for AI Singapore Online Safety Prize Challenge
☆14Mar 20, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JohannesTheo / trapped-in-texture-bias
View on GitHub
Official code release for the paper Trapped in texture bias? A large scale comparison of deep instance segmentation, accepted at ECCV 202…
☆16Jan 16, 2024Updated 2 years ago
Tony-Lowe / RotationDrag
View on GitHub
☆35Jan 23, 2024Updated 2 years ago
YoucanBaby / VTG-GPT
View on GitHub
[AAAI 2025] VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT
☆112Jan 13, 2026Updated 6 months ago
luchris429 / model-free-opponent-shaping
View on GitHub
Code for Model-Free Opponent Shaping (ICML 2022)
☆24Nov 18, 2022Updated 3 years ago
Elizabethxyhu / NeurIPS_Two_Stage_Predict-Optimize
View on GitHub
☆27Mar 25, 2026Updated 4 months ago
Senwang98 / MonoSKD
View on GitHub
[ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient
☆32Dec 8, 2023Updated 2 years ago
KaidiXu / ZO-minmax
View on GitHub
Zeroth-order Min-max Optimization
☆13Jun 28, 2020Updated 6 years ago
Fay-Y / Diffusion-RSCC
View on GitHub
☆22Mar 23, 2025Updated last year
ChengshuaiZhao0 / The-Wolf-Within
View on GitHub
☆13Jul 16, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Hannibal046 / nanoColBERT
View on GitHub
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆83Mar 18, 2024Updated 2 years ago
VNN-COMP / vnncomp2023_benchmarks
View on GitHub
Benchmarks for the VNN Comp 2023
☆16Jun 7, 2024Updated 2 years ago
clinicalml / co-llm
View on GitHub
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆128May 7, 2024Updated 2 years ago
EvanZhuang / MetaTree
View on GitHub
Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers
☆115Sep 13, 2024Updated last year
giangdip2410 / HyperRouter
View on GitHub
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Nov 29, 2023Updated 2 years ago
jinhaoduan / SecMI
View on GitHub
[ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?
☆45Sep 4, 2024Updated last year
arturxe2 / ASTRA
View on GitHub
PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…
☆44May 20, 2024Updated 2 years ago
LYX0501 / SPRING
View on GitHub
☆13Mar 25, 2023Updated 3 years ago
tangshuang / chatglmjs
View on GitHub
ChatGLM Node.js Addon
☆11Mar 4, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Zhen-Tan-dmml / TLP-FSNC
View on GitHub
Pytorch Implementation of LoG 22 [Oral] -- Transductive Linear Probing: A Novel Framework for Few-Shot Node Classification
☆17May 31, 2023Updated 3 years ago
tcwangshiqi-columbia / GCP-CROWN
View on GitHub
The official repo for GCP-CROWN paper
☆13Sep 26, 2022Updated 3 years ago
Wenyueh / game_theory
View on GitHub
How to create rational LLM-based agents? Using game-theoretic workflows!
☆110Jun 8, 2025Updated last year
azusakou / MiranDa
View on GitHub
MiranDa: Mimicking the learning process of human doctors to achieve causal inference for medication recommendation
☆13Jan 19, 2026Updated 6 months ago
manuelladron / semantic_based_painting
View on GitHub
☆43Sep 10, 2025Updated 10 months ago
MAGIC-AI4Med / MMedLM
View on GitHub
[Nature Communications] The official codes for "Towards Building Multilingual Language Model for Medicine"
☆284May 9, 2025Updated last year
Hangwei-Chen / EAMB-Net
View on GitHub
☆13Mar 10, 2024Updated 2 years ago