[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
☆268Jun 28, 2024Updated last year
Alternatives and similar repositories for Thought-Cloning
Users that are interested in Thought-Cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Jun 22, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆248Dec 11, 2025Updated 5 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆326Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆27May 7, 2025Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆988Nov 5, 2025Updated 6 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆62Jan 28, 2025Updated last year
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,139Dec 23, 2023Updated 2 years ago
- STEVE-1: A Generative Model for Text-to-Behavior in Minecraft☆211Jun 4, 2024Updated last year
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆21Oct 6, 2023Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆227Jun 6, 2023Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆39May 6, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple wrapper for OpenAI to log input/outputs.☆106Aug 28, 2023Updated 2 years ago
- ☆23Oct 11, 2024Updated last year
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 11 months ago
- ☆454Sep 27, 2023Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- ☆147May 2, 2024Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 3 years ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆802Oct 4, 2024Updated last year
- ☆223Jun 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Simple next-token-prediction for RLHF☆229Sep 30, 2023Updated 2 years ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- ☆56Sep 9, 2023Updated 2 years ago
- ☆1,061May 29, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,457Nov 26, 2025Updated 5 months ago
- Entailment self-training☆27May 30, 2023Updated 2 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆28Jun 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆982Oct 22, 2024Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Nov 7, 2023Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,660Mar 24, 2026Updated last month
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208May 24, 2023Updated 2 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆74Aug 31, 2024Updated last year
- ☆158Mar 18, 2023Updated 3 years ago