A beamer template for LAMDA lab at NJU
☆16Oct 17, 2020Updated 5 years ago
Alternatives and similar repositories for LAMDA-Beamer-Template
Users that are interested in LAMDA-Beamer-Template are comparing it to the libraries listed below
Sorting:
- ☆15Sep 14, 2020Updated 5 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- ☆30Mar 1, 2022Updated 4 years ago
- ☆10Oct 15, 2020Updated 5 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- ☆12Sep 15, 2021Updated 4 years ago
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- Toolkit of Causal Model-based Reinforcement Learning.☆33Jun 5, 2023Updated 2 years ago
- Re-implementations of SOTA RL algorithms.☆137Sep 7, 2023Updated 2 years ago
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- ☆30Dec 22, 2022Updated 3 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago
- A repo containing bash scripts to deploy reinforcement learning dev environment within one click!☆10May 15, 2025Updated 9 months ago
- A systematic design process for a self-organizing neuro-fuzzy Q-network for model-free and offline reinforcement learning.☆11May 29, 2023Updated 2 years ago
- ☆12Mar 17, 2024Updated last year
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago
- NJU-IT侠聊天机器人☆10Dec 13, 2021Updated 4 years ago
- ☆12May 14, 2024Updated last year
- 🛠Robust SSH: auto-reconnect SSH session that preserves your running shell and command. Intuitive, no server-side setup, aimed at simplic…☆13Nov 14, 2025Updated 3 months ago
- ☆11Jun 21, 2022Updated 3 years ago
- Implementation of NeurIPS2021 paper <On Effective Scheduling of Model-based Reinforcement Learning>☆13Nov 16, 2021Updated 4 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Reinforcement learning training project for a SLG game☆13Dec 21, 2017Updated 8 years ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- [ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"☆14Apr 30, 2023Updated 2 years ago
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- ☆12Jun 30, 2022Updated 3 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- The code repository for "OmniEvalKit: A Modular, Lightweight Toolbox for Evaluating Large Language Model and its Omni-Extensions"☆14Feb 21, 2025Updated last year
- ☆12Jul 17, 2023Updated 2 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- Kuaishou Online RL Benchmark☆19Oct 21, 2023Updated 2 years ago
- 基于树莓派(Pi)和PyGame的魔镜(Mirror)☆18Aug 5, 2022Updated 3 years ago
- The implementation of AAAI 2022 paper "Multi-Agent Incentive Communication via Decentralized Teammate Modeling".☆63Dec 12, 2023Updated 2 years ago
- NJU-IT侠社团网站系统,包括预约和后台等等...☆16May 11, 2022Updated 3 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- ☆16May 4, 2021Updated 4 years ago
- Flow RL is a high-performance RL library with flow and diffusion models.☆28Updated this week