Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below
Sorting:
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer☆16Aug 15, 2022Updated 3 years ago
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Sep 11, 2023Updated 2 years ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".☆18May 1, 2022Updated 3 years ago
- Documentation related to Microsoft Cognitive Research Technologies☆21Oct 6, 2022Updated 3 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- ☆17Dec 30, 2024Updated last year
- Early Detection of Fake News with Multi-source Weak Social Supervision☆23Jun 12, 2023Updated 2 years ago
- My personal web page☆11Feb 17, 2026Updated last week
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”☆26Mar 6, 2023Updated 2 years ago
- Programming and data analysis advanced in R course in Spring 2022/23☆11Jun 20, 2023Updated 2 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- ☆24Nov 10, 2020Updated 5 years ago
- ☆30Dec 22, 2022Updated 3 years ago
- A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.☆26Jan 27, 2026Updated last month
- Program and links to the material for the GloBIAS Training School 2025, Kobe, Japan.☆22Oct 27, 2025Updated 4 months ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Feb 6, 2022Updated 4 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- MirMachine, a command line tool to detect microRNA homologs in genome sequences.☆13Dec 3, 2025Updated 2 months ago
- ☆33Aug 30, 2024Updated last year
- [READ ONLY] Subtree split of the siyuan-packages-monorepo (see https://github.com/Zuoqiu-Yingyi/siyuan-packages-monorepo)☆12Jan 23, 2024Updated 2 years ago
- toRpEDA package☆19Jun 20, 2023Updated 2 years ago
- A library for testing concurrent C++ code and deterministically reproducing bugs.☆44Sep 29, 2022Updated 3 years ago
- Add AI to the Linux terminal☆10Apr 28, 2024Updated last year
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Terraform Script for - Storage, container and data life cycle rules creation at scale☆11Jan 10, 2023Updated 3 years ago
- Documentation for the OSTC team☆16Apr 24, 2025Updated 10 months ago
- A repo containing bash scripts to deploy reinforcement learning dev environment within one click!☆10May 15, 2025Updated 9 months ago
- Research simulation toolkit for federated learning☆13Nov 7, 2020Updated 5 years ago
- Samples for partner application development (OEM, MO, IHV) for Window☆18Jun 12, 2023Updated 2 years ago
- Sample notebooks for Juno☆11Mar 1, 2025Updated 11 months ago
- ☆14Feb 11, 2026Updated 2 weeks ago
- Article for Special Edition of Information: Machine Learning with Python☆14Jan 8, 2025Updated last year