umair-nasir14 / LLMaticLinks
LLMatic is a 2-archive QD algorithm that uses LLMs to mutate the networks. Tested for Neural Architecture search but can easily be used for any domain.
☆14Updated 11 months ago
Alternatives and similar repositories for LLMatic
Users that are interested in LLMatic are comparing it to the libraries listed below
Sorting:
- Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —☆201Updated last month
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆449Updated 10 months ago
- ☆220Updated 2 years ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆267Updated 10 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆234Updated 8 months ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆166Updated 2 months ago
- ☆98Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆140Updated last year
- A collection of LLM with RL papers☆276Updated last year
- Code for Contrastive Preference Learning (CPL)☆173Updated 7 months ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆163Updated last year
- Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)☆42Updated 11 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆181Updated 3 months ago
- Online Decision Transformer☆262Updated last year
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆11Updated 4 months ago
- ☆14Updated 9 months ago
- SocialJax: sequential social dilemma environments☆41Updated last month
- The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training☆15Updated 4 months ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Updated last year
- ☆14Updated last year
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆277Updated 3 years ago
- ☆234Updated 7 months ago
- Related papers for reinforcement learning, including classic papers and latest papers in top conferences☆449Updated 3 months ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆69Updated 2 years ago
- ☆79Updated last year
- A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.☆372Updated last year
- ☆143Updated last year
- Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.☆244Updated last year
- Deep reinforcement learning without experience replay, target networks, or batch updates.☆256Updated 3 months ago
- Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.☆47Updated 2 years ago