Shenzhi-Wang/recon

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shenzhi-Wang/recon)

Shenzhi-Wang / recon

The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)

☆15

Alternatives and similar repositories for recon

Users that are interested in recon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆41Oct 30, 2023Updated 2 years ago
3DAgentWorld / LLM-Game-Agent
View on GitHub
☆24Oct 13, 2024Updated last year
Andrewzh112 / ExpeL
View on GitHub
☆14Dec 16, 2023Updated 2 years ago
LR32768 / DL_theory_exp
View on GitHub
☆16Apr 12, 2024Updated 2 years ago
ZhaolinGao / REFUEL
View on GitHub
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
☆25Oct 8, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daswer123 / Voyager_checkpoint
View on GitHub
Checkpoint for Voyager, 160 iterations.
☆23May 27, 2023Updated 3 years ago
yueyang130 / SEEM
View on GitHub
Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
☆24Oct 30, 2023Updated 2 years ago
lucywang720 / model-surgery
View on GitHub
☆32Feb 23, 2025Updated last year
jonathanmli / Avalon-LLM
View on GitHub
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
☆159May 30, 2025Updated last year
Meihan-Liu / 24AAAI-A2GNN
View on GitHub
Rethinking Propagation for Unsupervised Graph Domain Adaptation (AAAI-24)
☆19Jul 18, 2024Updated 2 years ago
bigai-nlco / VideoTGB
View on GitHub
[EMNLP 2024] A Video Chat Agent with Temporal Prior
☆33Mar 2, 2025Updated last year
SHI-Labs / IMG-Multimodal-Diffusion-Alignment
View on GitHub
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025
☆30Oct 1, 2025Updated 9 months ago
LeapLabTHU / L2W-DEN
View on GitHub
[ECCV 2022] Learning to Weight Samples for Dynamic Early-exiting Networks
☆38Sep 28, 2023Updated 2 years ago
cmu-mind / RISE
View on GitHub
☆34Oct 31, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jiangjiechen / auction-arena
View on GitHub
Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…
☆49Jan 28, 2024Updated 2 years ago
LeapLabTHU / Dynamic_Perceiver
View on GitHub
Official implementation of Dynamic Perceiver
☆44Nov 16, 2023Updated 2 years ago
Hambaobao / Marathon
View on GitHub
Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.
☆10May 16, 2024Updated 2 years ago
ljcleo / agent_sense
View on GitHub
Benchmarking Social Intelligence of Language Agents through Interactive Scenarios
☆13Jan 4, 2025Updated last year
all-the-noises / eval-arena
View on GitHub
☆34Mar 21, 2026Updated 4 months ago
brendanhogan / completion_tree_view
View on GitHub
☆15Apr 26, 2025Updated last year
LeapLabTHU / OVM3D-Det
View on GitHub
☆55Jan 2, 2025Updated last year
wenge-research / CRE-SFT
View on GitHub
A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)
☆11May 8, 2025Updated last year
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bigai-nlco / langsuite
View on GitHub
Official Repo of LangSuitE
☆85Aug 15, 2024Updated last year
NathanHerr / LLM-First-Search
View on GitHub
☆17Jun 9, 2025Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
facebookresearch / NeuralMemory
View on GitHub
A Data Source for Reasoning Embodied Agents
☆20Sep 18, 2023Updated 2 years ago
X-LANCE / text2sql-multiturn-GPT
View on GitHub
[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
☆13May 7, 2024Updated 2 years ago
Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR
View on GitHub
The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…
☆61Jan 5, 2026Updated 6 months ago
AaronAnima / TarGF
View on GitHub
Official Implementation of Learning Gradient Fields for Object Rearrangement
☆33May 10, 2023Updated 3 years ago
Timscore25 / WriteAI
View on GitHub
📝🤖 WriteAI - Simplify your writing process with AI. Generate emails 📧, articles 📝, essays 📚, & more with ease. Writing is made easy …
☆12Feb 21, 2023Updated 3 years ago
mrmaheshrajput / productionizing-llms
View on GitHub
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆12Mar 27, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ShuaiGuo16 / LLM-guided-AutoML
View on GitHub
LLM-guided hyperparameter tuning
☆10Oct 7, 2023Updated 2 years ago
fmelihh / circuit-breaker-pattern-fastapi
View on GitHub
Python FastApi "Circuit Breaker" implementation
☆13Mar 14, 2025Updated last year
ArslanKAS / Large-Language-Models-with-Semantic-Search
View on GitHub
Explore from keyword search to dense retrieval and reranking, which injects the intelligence of LLMs into your search system, making it f…
☆14Aug 27, 2023Updated 2 years ago
NebulAICompany / SPD-RAG
View on GitHub
A hierarchical multi-agent framework for exhaustive cross-document question answering.
☆22Mar 14, 2026Updated 4 months ago
maitrix-org / dynamic-alignment-optimization
View on GitHub
[EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…
☆24Nov 17, 2024Updated last year
LLaMafia / SFT_function_learning
View on GitHub
Explore what LLMs are really leanring over SFT
☆28Mar 30, 2024Updated 2 years ago
initdebugs / YoloV8-User-Interface
View on GitHub
This is a simple user interface for YOLOv8, a popular object detection system. The program allows the user to select a video or image fil…
☆11Apr 4, 2023Updated 3 years ago