The official source code for "Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling" (ACL 2024, Findings)
☆15Aug 12, 2024Updated last year
Alternatives and similar repositories for recon
Users that are interested in recon are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆41Oct 30, 2023Updated 2 years ago
- ☆24Oct 13, 2024Updated last year
- ☆14Dec 16, 2023Updated 2 years ago
- ☆28Oct 9, 2024Updated last year
- ☆16Apr 12, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF☆25Oct 8, 2024Updated last year
- Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL☆24Oct 30, 2023Updated 2 years ago
- ☆32Feb 23, 2025Updated last year
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆153May 30, 2025Updated last year
- ☆18Oct 9, 2024Updated last year
- 基于轻量级 Qwen2.5-0.5B 和 SigLIP 的视觉语言多模态模型实现,包含训练和 SFT 代码。分享训练和 SFT 相关代码,记录一下探索和学习的过程。欢迎一起交流讨论~☆20Aug 31, 2025Updated 9 months ago
- ☆34Oct 31, 2024Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Jan 28, 2024Updated 2 years ago
- Official implementation of Dynamic Perceiver☆44Nov 16, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆21Oct 22, 2025Updated 7 months ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆30May 9, 2026Updated last month
- Official repository of Uni-AdaFocus (TPAMI 2024).☆60Dec 17, 2024Updated last year
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- ☆34Mar 21, 2026Updated 2 months ago
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆22May 15, 2025Updated last year
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 4 months ago
- Understanding deep networks and large models.☆29Jan 23, 2026Updated 4 months ago
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆28Apr 28, 2024Updated 2 years ago
- Clarity is a financial analysis agent framework built on native Claude-skill architecture. Adopting a Planning-with-Files approach, it co…☆58Jan 27, 2026Updated 4 months ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- ☆27May 30, 2026Updated 2 weeks ago
- ☆13Jul 14, 2024Updated last year
- Official Implementation of Learning Gradient Fields for Object Rearrangement☆33May 10, 2023Updated 3 years ago
- [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions☆13May 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆25Nov 17, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 3 months ago
- The purpose of this repository is to discuss on Audio transformers☆14Apr 16, 2026Updated 2 months ago
- Open-source, knowledge-grounded conversational assistant☆14Jun 30, 2025Updated 11 months ago
- 📝🤖 WriteAI - Simplify your writing process with AI. Generate emails 📧, articles 📝, essays 📚, & more with ease. Writing is made easy …☆12Feb 21, 2023Updated 3 years ago
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- A ChatGPT clone created with NextJs, TailwindCSS, Typescript, Firebase for Google-Authentication & Realtime Database, Vercel SWR for Data…☆10Sep 18, 2023Updated 2 years ago