shivakanthsujit / reducible-lossView external linksLinks
Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss
☆12Oct 10, 2022Updated 3 years ago
Alternatives and similar repositories for reducible-loss
Users that are interested in reducible-loss are comparing it to the libraries listed below
Sorting:
- A cell counter using computer vision techniques.☆10May 13, 2022Updated 3 years ago
- ☆12Aug 6, 2024Updated last year
- Synthetic Experience Replay☆109May 27, 2024Updated last year
- This repo refers to paper Invariant Transform Experience Replay. And this repo is built on top of OpenAI Baseline. For more information p…☆12Feb 2, 2021Updated 5 years ago
- Code for the paper Multi-Armed Bandits with Correlated Arms☆10Jun 3, 2021Updated 4 years ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆34May 30, 2025Updated 8 months ago
- Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains☆10Nov 12, 2021Updated 4 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …☆11Jun 20, 2025Updated 7 months ago
- ☆10Nov 14, 2023Updated 2 years ago
- PyTorch implementation of the estimator proposed in the paper "Estimating Differential Entropy under Gaussian Convolutions"☆13Oct 22, 2020Updated 5 years ago
- Codes for "Quantitative Comparison of Reinforcement Learning and Data-driven Model Predictive Control for Chemical and Biological Process…☆12Dec 18, 2023Updated 2 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆21Jan 6, 2026Updated last month
- Code for Expert Supervised Reinforcement Learning☆10Apr 7, 2021Updated 4 years ago
- This repository contains code for the paper "Better Estimation of the KL Divergence Between Language Models"☆15May 30, 2025Updated 8 months ago
- ☆13Mar 1, 2025Updated 11 months ago
- ROCC: Reinforcement learning for the Optimisation of Co-Cultures☆13Nov 17, 2020Updated 5 years ago
- suPER is a collaborative multi-agent RL algorithm☆14Jun 11, 2024Updated last year
- Actor Prioritized Experience Replay☆18Nov 20, 2023Updated 2 years ago
- code of IJCAI submission "Soft Hindsight Experience Replay"☆13Mar 23, 2020Updated 5 years ago
- ☆52Jul 21, 2022Updated 3 years ago
- A Visualization Tool for GPU Occupancy on S Cluster.☆13Nov 16, 2022Updated 3 years ago
- Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)☆12Jun 13, 2023Updated 2 years ago
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated last year
- Cutting-edge platform for LLM agent tuning. Deliver RL tuning with flexibility, reliability, speed, multi-agent optimization and realtime…☆39Updated this week
- My Linux and Mac configuration☆11Oct 1, 2025Updated 4 months ago
- Quantum Walk Graph Classifier☆14Feb 8, 2023Updated 3 years ago
- [Ultra Fast&Powerful Diffusion RL] Reinforcing Diffusion Models by Direct Group Preference Optimization☆40Oct 11, 2025Updated 4 months ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆19Sep 14, 2025Updated 5 months ago
- This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"☆15Sep 21, 2023Updated 2 years ago
- Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"☆14May 24, 2021Updated 4 years ago
- ☆19Aug 4, 2025Updated 6 months ago
- Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)☆14Feb 20, 2023Updated 2 years ago
- Python wrapper for lean-gym☆12Apr 5, 2023Updated 2 years ago
- ☆16Mar 8, 2022Updated 3 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆15Jul 15, 2024Updated last year