umair-nasir14 / LLMaticLinks

LLMatic is a 2-archive QD algorithm that uses LLMs to mutate the networks. Tested for Neural Architecture search but can easily be used for any domain.

☆14

Alternatives and similar repositories for LLMatic

Users that are interested in LLMatic are comparing it to the libraries listed below

Sorting:

dunnolab / awesome-in-context-rl
Awesome In-Context RL: A curated list of In-Context Reinforcement Learning - - —
☆201Updated last month
WindyLab / LLM-RL-Papers
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆449Updated 10 months ago
minaek / reward_design_with_llms
☆220Updated 2 years ago
flowersteam / Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆267Updated 10 months ago
flowersteam / lamorel
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆234Updated 8 months ago
balrog-ai / BALROG
Benchmarking Agentic LLM and VLM Reasoning On Games
☆166Updated 2 months ago
abdulhaim / LMRL-Gym
☆98Updated last year
microsoft / SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆140Updated last year
floodsung / LLM-with-RL-papers
A collection of LLM with RL papers
☆276Updated last year
jhejna / cpl
Code for Contrastive Preference Learning (CPL)
☆173Updated 7 months ago
csmile-1006 / PreferenceTransformer
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆163Updated last year
snu-mllab / DPPO
Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)
☆42Updated 11 months ago
YifeiZhou02 / ArCHer
Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"
☆181Updated 3 months ago
facebookresearch / online-dt
Online Decision Transformer
☆262Updated last year
nicoladainese96 / code-world-models
Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.
☆11Updated 4 months ago
todexter3 / Richelieu
☆14Updated 9 months ago
cooperativex / SocialJax
SocialJax: sequential social dilemma environments
☆41Updated last month
jinpz / q_sharp
The official code release for Q#: Provably Optimal Distributional RL for LLM Post-Training
☆15Updated 4 months ago
beanie00 / Decision-ConvFormer
[ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"
☆12Updated last year
Holmeswww / SPRING
☆14Updated last year
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆277Updated 3 years ago
openrlbenchmark / openrlbenchmark
☆234Updated 7 months ago
yingchengyang / Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
☆449Updated 3 months ago
BladeTransformerLLC / OvercookedGPT
An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…
☆69Updated 2 years ago
yuqingd / ellm
☆79Updated last year
123penny123 / Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
☆372Updated last year
agentification / RAFA_code
☆143Updated last year
AGI-Edgerunners / LLM-Optimizers-Papers
Must-read Papers on Large Language Model (LLM) as Optimizers and Automatic Optimization for Prompting LLMs.
☆244Updated last year
mohmdelsayed / streaming-drl
Deep reinforcement learning without experience replay, target networks, or batch updates.
☆256Updated 3 months ago
hammer-wang / Awesome-Transformers-for-Sequential-Decision-Making
Tracking literature and additional online resources on transformers for sequential decision making including RL and beyond.
☆47Updated 2 years ago