google-deepmind/game_arena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/game_arena)

google-deepmind / game_arena

☆110

Alternatives and similar repositories for game_arena

Users that are interested in game_arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jerry3027 / PolyIE
View on GitHub
☆16Jan 26, 2024Updated 2 years ago
ritaranx / BMRetriever
View on GitHub
[EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".
☆26Sep 19, 2024Updated last year
smiles724 / Awesome-LLM-RLVR
View on GitHub
Collection of latest papers and materials in the area of RLVR!
☆125Updated this week
aiopsplus / Carllm
View on GitHub
The supplementary material for the paper "Fine-tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code R…
☆16Aug 12, 2024Updated last year
night-chen / DyGen
View on GitHub
[KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".
☆11Jun 14, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
securade / sentinel
View on GitHub
Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.
☆30Apr 6, 2025Updated last year
wshi83 / MedAdapter
View on GitHub
[EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning
☆37Dec 26, 2024Updated last year
barthelemymp / TULIP-TCR
View on GitHub
☆14May 15, 2024Updated 2 years ago
tarsyang / quantevolve
View on GitHub
Evolutionary Quantitative Trading Strategy Development System. Fork of OpenEvolve
☆44May 30, 2025Updated last year
wshi83 / MedAgentGym
View on GitHub
[ICLR'26] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale
☆118Apr 12, 2026Updated 2 months ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
ZhaolinGao / A-PO
View on GitHub
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
☆41May 30, 2025Updated last year
DotDoug / TreatiseAI
View on GitHub
A simple GPT-3 interface to automate core legal writing tasks
☆13Mar 8, 2023Updated 3 years ago
tianshilu / QBRC-Somatic-Pipeline
View on GitHub
QBRC Somatic Mutation Calling Pipeline
☆16Feb 8, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
rdi-berkeley / agents-last-exam
View on GitHub
Agents' Last Exam
☆759Updated this week
aws-containers / hello-app-runner-nodejs
View on GitHub
Example Next.js application for App Runner with DynamoDB using Copilot CLI
☆13Jan 29, 2026Updated 5 months ago
WujiangXu / EPO
View on GitHub
The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"
☆39Oct 1, 2025Updated 9 months ago
thunlp / AutoForm
View on GitHub
Code for paper "Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication"
☆23Mar 30, 2024Updated 2 years ago
codelion / dynamic-shell-server
View on GitHub
Dynamic Shell Command MCP Server
☆41Feb 27, 2025Updated last year
bbartoldson / TBA
View on GitHub
Official implementation of TBA for async LLM post-training.
☆31Nov 5, 2025Updated 8 months ago
usememos / mui
View on GitHub
☆26Apr 18, 2026Updated 2 months ago
yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆35Oct 16, 2024Updated last year
Lingkai-Kong / so-ebm
View on GitHub
Code for paper: End-to-end Stochastic Optimization with Energy-based Model
☆16Feb 14, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 8 months ago
Tener / spike
View on GitHub
Experimental web browser
☆21Mar 15, 2012Updated 14 years ago
sourceclear / ransomware-poc
View on GitHub
A poc to demonstrate how ransomware can spread to enterprise apps through libraries
☆11Mar 28, 2018Updated 8 years ago
0x404 / conventional-commit-classification
View on GitHub
A First Look at Conventional Commits Classification
☆15Nov 18, 2024Updated last year
NJU-LINK / CodeTracer
View on GitHub
☆79Jun 19, 2026Updated 2 weeks ago
huggingface / transformers_bloom_parallel
View on GitHub
Techniques used to run BLOOM at inference in parallel
☆37Oct 21, 2022Updated 3 years ago
guardagent / code
View on GitHub
☆47Dec 9, 2025Updated 6 months ago
s-smits / automatic-learning-amplifier
View on GitHub
MLX-based QA pair generator and LLM finetuning tool in Streamlit
☆42Oct 18, 2025Updated 8 months ago
DarkStarStrix / DataVolt
View on GitHub
Reusable data engineering toolkit My personal data infrastructure
☆19Oct 29, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
xlang-ai / computer-agent-arena
View on GitHub
[ICLR 2026] Computer Agent Arena: Toward Human-Centric Evaluation and Analysis of Computer-Use Agents
☆65Feb 26, 2026Updated 4 months ago
mayank31398 / ladder-residual-inference
View on GitHub
☆14Jul 13, 2025Updated 11 months ago
DeqingFu / transformers-icl-second-order
View on GitHub
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆20Nov 19, 2024Updated last year
Jumitti / chicken_AI
View on GitHub
Just a Streamlit app to generate scientific review with Deep Learning, Neuronal Network and AI
☆14Sep 18, 2025Updated 9 months ago
peterljq / Tutorial-of-Data-Distillation-and-Condensation
View on GitHub
A comprehensive overview of Data Distillation and Condensation (DDC). DDC is a data-centric task where a representative (i.e., small but …
☆13Dec 1, 2022Updated 3 years ago
wshi83 / EhrAgent
View on GitHub
[EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records
☆137Dec 26, 2024Updated last year
legal-nlp / oab-exams
View on GitHub
data about OAB Exams
☆11Oct 1, 2018Updated 7 years ago