☆24Oct 13, 2024Updated last year
Alternatives and similar repositories for LLM-Game-Agent
Users that are interested in LLM-Game-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)☆14Aug 12, 2024Updated last year
- ☆26May 20, 2024Updated last year
- RecAlpaca: A simple framework combing Alpaca and Recommendations.☆34Mar 30, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- LVAS-Agent Code Base☆22Apr 15, 2025Updated 11 months ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- ☆53Mar 3, 2026Updated 3 weeks ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆20Oct 22, 2025Updated 5 months ago
- ECCV2020_Spatial Hierarchy Aware Residual Pyramid Network for Time-of-Flight Depth Denoising☆12Sep 24, 2020Updated 5 years ago
- ☆10Sep 26, 2023Updated 2 years ago
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆51Oct 16, 2025Updated 5 months ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆26Feb 18, 2026Updated last month
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆48Sep 26, 2024Updated last year
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆15Nov 20, 2025Updated 4 months ago
- 《金融中的人工智能》配套代码☆11Sep 20, 2022Updated 3 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Aug 11, 2022Updated 3 years ago
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆10May 8, 2025Updated 10 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆58Feb 22, 2026Updated last month
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Hands-On Tutorial on Building Multimodal RAG Systems☆13Apr 10, 2025Updated 11 months ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- A compiler of Decaf(an object-oriented compiler)☆12Sep 26, 2017Updated 8 years ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆31Jan 29, 2026Updated last month
- Official repository of "Zero-Shot Character Identification and Speaker Prediction in Comics via Iterative Multimodal Fusion" (ACMMM 2024)☆15Oct 31, 2024Updated last year
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Jun 6, 2024Updated last year
- Introduce a novel Video Trimming (VT) task and proposes an agent-based approach (AVT) for detecting wasted footage, selecting valuable se…☆23Jan 20, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆55Mar 17, 2026Updated last week
- ☆14Jan 12, 2022Updated 4 years ago
- 自动每天给女友发邮件☆12Jun 8, 2021Updated 4 years ago
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- Implementation of a neural network MLP in C++.☆10Dec 17, 2018Updated 7 years ago
- ☆12Mar 27, 2024Updated last year
- Rummikub solver coded in Python that uses integer linear programming to maximise the number or value of tiles placed in the popular board…☆15Mar 14, 2021Updated 5 years ago