yuqingd/ellm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuqingd/ellm)

yuqingd / ellm

☆91

Alternatives and similar repositories for ellm

Users that are interested in ellm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZJLAB-AMMI / LLM4Teach
View on GitHub
Python code to implement LLM4Teach, a policy distillation approach for teaching reinforcement learning agents with Large Language Model
☆54Apr 19, 2024Updated 2 years ago
WeihaoTan / TWOSOME
View on GitHub
Implementation of TWOSOME
☆82Jan 11, 2025Updated last year
flowersteam / Grounding_LLMs_with_online_RL
View on GitHub
We perform functional grounding of LLMs' knowledge in BabyAI-Text
☆276Oct 27, 2025Updated 8 months ago
frankroeder / lanro-gym
View on GitHub
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Jan 27, 2026Updated 5 months ago
DanDoge / Palm
View on GitHub
team Doggeee's solution to Ego4D LTA challenge@CVPRW23'
☆14Nov 4, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
abdulhaim / LMRL-Gym
View on GitHub
☆116Jul 2, 2024Updated 2 years ago
microsoft / SmartPlay
View on GitHub
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …
☆146Apr 11, 2024Updated 2 years ago
danijar / crafter
View on GitHub
Benchmarking the Spectrum of Agent Capabilities
☆578Jan 23, 2024Updated 2 years ago
liyheng / FOP
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
PKU-RL / AdaRefiner
View on GitHub
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆19Aug 9, 2024Updated last year
burchim / TWISTER
View on GitHub
[ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)
☆57Mar 9, 2025Updated last year
benellis3 / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆19Aug 20, 2023Updated 2 years ago
WindyLab / LLM-RL-Papers
View on GitHub
Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.
☆558Nov 17, 2025Updated 8 months ago
ChanLiang / acl-emnlp-poster-templates
View on GitHub
Templates and examples for ACL and EMNLP conference posters.
☆15Oct 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
argmax-ai / aime
View on GitHub
Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"
☆13Dec 4, 2023Updated 2 years ago
princeton-nlp / lwm
View on GitHub
We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…
☆25Feb 10, 2024Updated 2 years ago
ethz-mrl / VidBot
View on GitHub
[CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
☆52Apr 10, 2026Updated 3 months ago
LAMDA-RL / ODIS
View on GitHub
The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
☆45Oct 31, 2024Updated last year
xlang-ai / text2reward
View on GitHub
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
☆210Dec 17, 2024Updated last year
ReinholdM / Offline-Pre-trained-Multi-Agent-Decision-Transformer
View on GitHub
☆120Apr 15, 2023Updated 3 years ago
dojeon-ai / Atari-PB
View on GitHub
Official repository for "Investigating Pre-Training Objectives for Generalization in Visual Reinforcement Learning" (ICML 2024)
☆11Sep 16, 2025Updated 10 months ago
StavrosOrf / DT4EVs
View on GitHub
A Decision Transformer for solving optimal EV charging problems using offline data.
☆21Jan 19, 2026Updated 6 months ago
omron-sinicx / action-constrained-RL-benchmark
View on GitHub
☆28Apr 26, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
samjia2000 / HSP
View on GitHub
This is a repository for Hidden-utility Self-Play.
☆27Jul 27, 2023Updated 2 years ago
zisikons / deep-rl
View on GitHub
Deep Learning (FS 2020)
☆17Oct 10, 2022Updated 3 years ago
hanggao-gh / InteractiveMemorySharingLLM
View on GitHub
☆22Oct 12, 2024Updated last year
ahu-bioinf-lab / AMGDTI
View on GitHub
A Network Integration Approach for Drug-Target Interaction Prediction
☆13Apr 5, 2025Updated last year
Wen2chao / RL-Algorithm
View on GitHub
Hello😜
☆29Nov 8, 2020Updated 5 years ago
Aaron617 / AgentGen
View on GitHub
[KDD 2025] AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation
☆34Nov 18, 2025Updated 8 months ago
sogno-platform / proloaf
View on GitHub
A Probabilistic Load Forecasting Project. Mirror of https://git.rwth-aachen.de/acs/public/automation/plf/proloaf
☆16Apr 2, 2026Updated 3 months ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
GuanSuns / LLMs-World-Models-for-Planning
View on GitHub
The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…
☆108Aug 11, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
flowersteam / lamorel
View on GitHub
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆249Dec 11, 2025Updated 7 months ago
mahaitongdae / Feasible-Actor-Critic
View on GitHub
Code for paper Feasible Actor-Critic: Constrained Reinforcement Learning for Ensuring Statewise Safety.
☆20May 22, 2022Updated 4 years ago
HongyeGuo / DIRL-bidding_preference
View on GitHub
The code of the algorithm proposed in the paper "Deep Inverse Reinforcement Learning for Objective Function Identification in Bidding Mod…
☆15Aug 13, 2021Updated 4 years ago
nsidn98 / LLaMAR
View on GitHub
Code for our paper LLaMAR: LM-based Long-Horizon Planner for Multi-Agent Robotics
☆41Feb 10, 2025Updated last year
Visual-AI / GAMEBoT
View on GitHub
[ACL 2025] GAMEBoT: Transparent Assessment of LLM Reasoning in Games
☆33May 15, 2026Updated 2 months ago
wutaiqiang / awesome-GNN2MLP-distillation
View on GitHub
Learning MLPs to replace GNN
☆10Jun 3, 2023Updated 3 years ago
j3soon / dfac
View on GitHub
[ICML 2021] DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
☆31Jun 1, 2023Updated 3 years ago