[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
☆267Jun 28, 2024Updated last year
Alternatives and similar repositories for Thought-Cloning
Users that are interested in Thought-Cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Jun 22, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆247Dec 11, 2025Updated 3 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆324Oct 22, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆414Jan 7, 2026Updated 2 months ago
- ☆25May 7, 2025Updated 10 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆963Nov 5, 2025Updated 4 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆61Jan 28, 2025Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,140Dec 23, 2023Updated 2 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆205Jun 4, 2024Updated last year
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆21Oct 6, 2023Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆227Jun 6, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- A simple wrapper for OpenAI to log input/outputs.☆106Aug 28, 2023Updated 2 years ago
- ☆23Oct 11, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- ☆452Sep 27, 2023Updated 2 years ago
- ☆146May 2, 2024Updated last year
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- ☆222Jun 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Feb 27, 2024Updated 2 years ago
- Simple next-token-prediction for RLHF☆229Sep 30, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- ☆56Sep 9, 2023Updated 2 years ago
- ☆1,061May 29, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,414Nov 26, 2025Updated 4 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Entailment self-training☆27May 30, 2023Updated 2 years ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆977Oct 22, 2024Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆135Nov 7, 2023Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,621Updated this week
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆74Aug 31, 2024Updated last year