Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
Alternatives and similar repositories for MAMBA
Users that are interested in MAMBA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for paper: Efficient deep reinforcement learning via adaptive policy transfer☆16Aug 15, 2022Updated 3 years ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Documentation related to Microsoft Cognitive Research Technologies☆21Oct 6, 2022Updated 3 years ago
- ☆30Jun 4, 2022Updated 3 years ago
- Early Detection of Fake News with Multi-source Weak Social Supervision☆24Jun 12, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A lightweight reimplementation of Adversarially Trained Actor Critic☆20Mar 19, 2026Updated last month
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning☆20Dec 30, 2022Updated 3 years ago
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated last year
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- BANG is a new pretraining model to Bridge the gap between Autoregressive (AR) and Non-autoregressive (NAR) Generation. AR and NAR generat…☆28Feb 6, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".☆18May 1, 2022Updated 3 years ago
- Python code : Clustering by fast search and find of density peaks☆10Oct 13, 2017Updated 8 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- aicreator for aidata☆14May 17, 2023Updated 2 years ago
- Hands-on with popular deep learning datasets and tasks☆13Apr 4, 2023Updated 3 years ago
- Code for "PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection"☆26Apr 21, 2026Updated last week
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration ”☆26Mar 6, 2023Updated 3 years ago
- I have targeted to solve the benchmark problem in Reinforcement learning literature using Deep Q-networks with images as the only input t…☆12Dec 2, 2019Updated 6 years ago
- ☆17Dec 30, 2024Updated last year
- REOBench: Benchmarking Robustness of Earth Observation Foundation Models☆24Oct 28, 2025Updated 6 months ago
- ☆16Mar 6, 2025Updated last year
- Task dependent skill transformation is challenging due to the ignorance of the relationships between primitive skills. In this project, w…☆14Jun 4, 2020Updated 5 years ago
- ☆24Nov 10, 2020Updated 5 years ago
- The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"☆18Jul 20, 2023Updated 2 years ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for 'Mapping State Space using Landmarks for Universal Goal Reaching'.☆16Dec 26, 2023Updated 2 years ago
- Cost-effective and scalable LiDAR simulation by factoring the real world.☆12Dec 8, 2025Updated 4 months ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Runtime for deep learning workload☆21May 24, 2022Updated 3 years ago
- Edutainment game teaching players concepts around machine learning☆15Feb 18, 2020Updated 6 years ago
- [ICCV25] Official implementation of the paper HoliTracer.☆44Apr 7, 2026Updated 3 weeks ago
- Hyperparameter Tuning for Deep Learning☆16Feb 5, 2020Updated 6 years ago