☆145May 2, 2024Updated last year
Alternatives and similar repositories for RAFA_code
Users that are interested in RAFA_code are comparing it to the libraries listed below
Sorting:
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆15Mar 26, 2024Updated last year
- Implementation of A Context-Integrated Transformer-Based Neural Network for Auction Design (ICML2022).☆19Jun 30, 2022Updated 3 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Website for HKU NLP group (under construction)☆14Dec 23, 2025Updated 2 months ago
- [ICML 2024] Self-Infilling Code Generation☆18May 5, 2024Updated last year
- ☆15Jul 9, 2025Updated 8 months ago
- A library for advanced large language model reasoning☆2,338Jun 10, 2025Updated 9 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆34Mar 29, 2023Updated 2 years ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Mar 31, 2025Updated 11 months ago
- FireAct: Toward Language Agent Fine-tuning☆292Oct 22, 2023Updated 2 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆268Jun 28, 2024Updated last year
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆820Jul 30, 2024Updated last year
- Reasoning with Language Model is Planning with World Model☆188Aug 25, 2023Updated 2 years ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆160Oct 30, 2024Updated last year
- [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning☆3,100Jan 14, 2025Updated last year
- ☆15Sep 7, 2022Updated 3 years ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86May 21, 2025Updated 10 months ago
- Work in progress! I don't recommend looking at the code right now.☆24Dec 3, 2025Updated 3 months ago
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆12Sep 11, 2022Updated 3 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Jul 31, 2023Updated 2 years ago
- Official Repo of LangSuitE☆84Aug 15, 2024Updated last year
- ☆189Jan 27, 2025Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Jul 12, 2023Updated 2 years ago
- [ECCV2024] 🐙Octopus, an embodied vision-language model trained with RLEF, emerging superior in embodied visual planning and programming.☆297May 20, 2024Updated last year
- ☆22Nov 11, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Jan 3, 2024Updated 2 years ago
- MPI Code Generation through Domain-Specific Language Models☆15Nov 19, 2024Updated last year
- ☆124Feb 21, 2025Updated last year
- ☆44Sep 19, 2024Updated last year
- Hamiltonian neural network implementation for Henon Heiles dynamical system learning mix of order and chaos☆11Dec 2, 2023Updated 2 years ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆21Oct 29, 2022Updated 3 years ago