thanhkaist / mimo_Q_networkView external linksLinks
Implementation of Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
☆31Dec 9, 2025Updated 2 months ago
Alternatives and similar repositories for mimo_Q_network
Users that are interested in mimo_Q_network are comparing it to the libraries listed below
Sorting:
- CCFDM reinforcement learning☆40Dec 28, 2021Updated 4 years ago
- [ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"☆18Dec 22, 2025Updated last month
- Predictive Coding for Decision Transformer (IROS 2024)☆41Jun 19, 2025Updated 7 months ago
- Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models (ICML 2025)☆53Dec 26, 2025Updated last month
- Dual-scale Doppler Attention for Human Identification☆47Aug 13, 2025Updated 6 months ago
- (ICCV2025) Occlusion-robust Stylization for Drawing-based 3D Animation☆50Dec 26, 2025Updated last month
- Policy Learning from Large Vision-Language Model Feedback Without Reward Modeling (IROS 2025)☆36Dec 26, 2025Updated last month
- [ICLR'25] MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation☆38Dec 25, 2025Updated last month
- ☆39Dec 21, 2024Updated last year
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆49Nov 19, 2024Updated last year
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆40Jun 12, 2024Updated last year
- ☆33Nov 26, 2024Updated last year
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆44Mar 12, 2024Updated last year
- [CVPR 2025] ITA-MDT official implementation☆67Dec 21, 2025Updated last month
- [ECCV'22] SQuiDNet: Selective Query-guided Debiasing Network for Video Corpus Moment Retrieval☆73Nov 23, 2022Updated 3 years ago
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆70Aug 23, 2025Updated 5 months ago
- 비디오 기반 인공지능 대화시스템☆14Dec 23, 2023Updated 2 years ago
- Multimodal_AI_Video_Dialogue☆16Dec 3, 2024Updated last year
- Code for AAAI 2021 paper "SCNet: Traning Inference Sample Consistency for Instance Segmentation".☆22Jan 31, 2021Updated 5 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- Code repository for FreGrad☆52May 19, 2024Updated last year
- Implement mean shift cluster from numpy + sklearn + GPU-pytorch☆10Apr 19, 2023Updated 2 years ago
- Python3 ROS Interface to Rethink Sawyer Robots with OpenAI Gym Compatibility☆62Apr 13, 2019Updated 6 years ago
- MImE - Manipulation Imitation Environments☆14Feb 1, 2022Updated 4 years ago
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆22Jun 9, 2024Updated last year
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Feb 21, 2022Updated 3 years ago
- Official code for "SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning" (ICML 2023)☆32Dec 21, 2023Updated 2 years ago
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆62Nov 7, 2024Updated last year
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆76Sep 12, 2024Updated last year
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Jun 12, 2022Updated 3 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆101Jan 4, 2021Updated 5 years ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆199Dec 16, 2023Updated 2 years ago
- Multitask Environments for RL☆281Aug 23, 2021Updated 4 years ago
- ☆359Oct 12, 2022Updated 3 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆407Oct 7, 2023Updated 2 years ago
- A large-scale benchmark and learning environment.☆1,686Jan 25, 2025Updated last year
- A PyTorch Platform for Distributed RL☆752Sep 15, 2021Updated 4 years ago
- [NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.☆866Aug 12, 2024Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆944Feb 16, 2025Updated 11 months ago