ShengranHu / Thought-CloningView external linksLinks
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
☆267Jun 28, 2024Updated last year
Alternatives and similar repositories for Thought-Cloning
Users that are interested in Thought-Cloning are comparing it to the libraries listed below
Sorting:
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Jun 22, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated 9 months ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆324Oct 22, 2024Updated last year
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆244Dec 11, 2025Updated 2 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆946Nov 5, 2025Updated 3 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆228Jun 6, 2023Updated 2 years ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,139Dec 23, 2023Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- Code for "Learning to Model the World with Language." ICML 2024 Oral.☆413Jan 7, 2026Updated last month
- ☆144May 2, 2024Updated last year
- A simple wrapper for OpenAI to log input/outputs.☆106Aug 28, 2023Updated 2 years ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆94May 23, 2023Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 8 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated 11 months ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆220Jun 6, 2023Updated 2 years ago
- ☆21Oct 6, 2023Updated 2 years ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆780Oct 4, 2024Updated last year
- ☆449Sep 27, 2023Updated 2 years ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,327Nov 26, 2025Updated 2 months ago
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898☆240May 5, 2024Updated last year
- ☆25May 7, 2025Updated 9 months ago
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- Simple next-token-prediction for RLHF☆229Sep 30, 2023Updated 2 years ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆207May 24, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- ☆1,057May 29, 2023Updated 2 years ago
- ☆56Sep 9, 2023Updated 2 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- Entailment self-training☆26May 30, 2023Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 2 years ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Dec 12, 2024Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆976Oct 22, 2024Updated last year
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,592Dec 11, 2024Updated last year
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆270Apr 18, 2024Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- Symmetric Encryption with Language Models☆13Jun 13, 2023Updated 2 years ago
- ☆158Mar 18, 2023Updated 2 years ago