Based on the learnware paradigm, the learnware package supports the entire process including the submission, usability testing, organization, identification, deployment, and reuse of learnwares. Simultaneously, this repository serves as Beimingwu's engine, supporting its core functionalities.
☆109May 27, 2025Updated 9 months ago
Alternatives and similar repositories for Learnware
Users that are interested in Learnware are comparing it to the libraries listed below
Sorting:
- Beimingwu is the first systematic open-source implementation of the learnware dock system, providing a preliminary research platform for …☆122Jul 17, 2024Updated last year
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated last year
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- Imitation learning from multiple experts☆13Aug 29, 2022Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆31Jul 4, 2024Updated last year
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆27Jan 27, 2026Updated last month
- ☆30Dec 22, 2022Updated 3 years ago
- A Holistic Embodied Cognition Benchmark☆18Apr 3, 2025Updated 11 months ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- ☆24Oct 26, 2021Updated 4 years ago
- Quadra: Effortless and reproducible deep learning workflows with configuration files.☆50Feb 23, 2026Updated 3 weeks ago
- Fast dataset format and loader☆24Mar 6, 2026Updated last week
- re-implementation of the offline model-based RL algorithm MOPO in pytorch☆25Feb 28, 2022Updated 4 years ago
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 3 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Implementation of SAC and TD3 based on various RNN and Transformer.☆28Sep 28, 2024Updated last year
- Learning records for building a large language model from scratch☆59Jan 1, 2025Updated last year
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- ☆24Nov 10, 2020Updated 5 years ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆29Feb 21, 2022Updated 4 years ago
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- ☆45Jun 10, 2025Updated 9 months ago
- ☆30Mar 1, 2022Updated 4 years ago
- Benchmarked implementations of Offline RL Algorithms.☆77Mar 4, 2025Updated last year
- 南哪充电 —— 南京大学 (Nanjing Univerisity) 校内充电站监测系统☆35Feb 28, 2026Updated 2 weeks ago
- Github Actions: 完成每日健康填报打卡,So easy☆66Mar 24, 2022Updated 3 years ago
- NJUAI-Master-Courses☆29Aug 20, 2023Updated 2 years ago
- Experimental paper writing linter.☆35Sep 2, 2024Updated last year
- RLA is a tool for managing your RL experiments automatically☆72Feb 7, 2023Updated 3 years ago
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- ☆33Aug 30, 2024Updated last year
- Stream live plots to a matplotlib figure☆81Apr 18, 2025Updated 10 months ago
- Author's PyTorch implementation of TD7 for online and offline RL☆162Sep 12, 2023Updated 2 years ago
- ☆43May 25, 2023Updated 2 years ago