shivakanthsujit/reducible-loss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shivakanthsujit/reducible-loss)

shivakanthsujit / reducible-loss

Codebase for Prioritizing samples in Reinforcement Learning with Reducible Loss

☆12

Alternatives and similar repositories for reducible-loss

Users that are interested in reducible-loss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

baturaysaglam / actor-prioritized-exp-replay
View on GitHub
Actor Prioritized Experience Replay
☆19Nov 20, 2023Updated 2 years ago
pierrelux / rlss2017
View on GitHub
☆17Jul 3, 2017Updated 9 years ago
Nikunj-Gupta / conformal-agent-modelling
View on GitHub
CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning
☆15Jun 24, 2024Updated 2 years ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
martius-lab / GateL0RD-paper
View on GitHub
Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
☆11Nov 12, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
brightjade / PRiSM
View on GitHub
Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …
☆11Jun 20, 2025Updated last year
brewinn / Roadrunner-CellCounter
View on GitHub
A cell counter using computer vision techniques.
☆10May 13, 2022Updated 4 years ago
dojeon-ai / SimTPR
View on GitHub
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Jun 13, 2023Updated 3 years ago
sen-ye / linux-clash
View on GitHub
☆10Nov 14, 2023Updated 2 years ago
danijar / dotfiles
View on GitHub
Mac and Linux config
☆11Jul 20, 2026Updated last week
mgerstgrasser / super
View on GitHub
suPER is a collaborative multi-agent RL algorithm
☆14Jun 11, 2024Updated 2 years ago
DIYer22 / sddn
View on GitHub
Core Library of Discrete Distribution Networks (ICLR 2025)
☆15Oct 12, 2025Updated 9 months ago
birlrobotics / ITER_KER_GER
View on GitHub
This repo refers to paper Invariant Transform Experience Replay. And this repo is built on top of OpenAI Baseline. For more information p…
☆12Feb 2, 2021Updated 5 years ago
CHUENGMINCHOU / AW-PER-A2C
View on GitHub
The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …
☆10Aug 7, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yevvonlim / kai-presentation
View on GitHub
Claude Code skill for KAI presentation design in HTML
☆16Mar 20, 2026Updated 4 months ago
snu-mllab / DCPG
View on GitHub
Official PyTorch implementation of "Rethinking Value Function Learning for Generalization in Reinforcement Learning" (NeurIPS 2022)
☆15Feb 20, 2023Updated 3 years ago
rozenk30 / Quantitative-Comparison-of-RL-and-MPC
View on GitHub
Codes for "Quantitative Comparison of Reinforcement Learning and Data-driven Model Predictive Control for Chemical and Biological Process…
☆12Dec 18, 2023Updated 2 years ago
manish-pra / copg
View on GitHub
This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.
☆24Aug 1, 2020Updated 5 years ago
huxiao09 / QPA
View on GitHub
☆13Sep 24, 2024Updated last year
ling-pan / OMAR
View on GitHub
☆55Jul 21, 2022Updated 4 years ago
ambujtewari / stats701-winter2021
View on GitHub
Theory of Reinforcement Learning
☆18Apr 20, 2021Updated 5 years ago
FSLight1996 / SHER
View on GitHub
code of IJCAI submission "Soft Hindsight Experience Replay"
☆13Mar 23, 2020Updated 6 years ago
saebrahimi / Emotion-Recognition-RNN
View on GitHub
Recurrent Neural Networks for Emotion Recognition in Video
☆86Jan 7, 2017Updated 9 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
vmichals / FigureQA-baseline
View on GitHub
TensorFlow implementation of the CNN-LSTM, Relation Network and text-only baselines for the paper "FigureQA: An Annotated Figure Dataset …
☆35Feb 22, 2018Updated 8 years ago
veugene / data_tools
View on GitHub
High performance data loading, preprocessing, or preparation for deep learning.
☆14Mar 12, 2021Updated 5 years ago
ZibinDong / cocos
View on GitHub
Official implementation of the paper "Conditioning Matters: Training Diffusion Policies is Faster Than You Think".
☆18May 19, 2025Updated last year
guosyjlu / OEMA
View on GitHub
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
☆16Aug 14, 2023Updated 2 years ago
shivakanthsujit / VAE-PyTorch
View on GitHub
Variational Autoencoders trained on the SVHN and FashionMNIST data-sets implemented in PyTorch
☆30Oct 3, 2023Updated 2 years ago
gemcollector / PIE-G
View on GitHub
This is the repo of NeurIPS 2022 paper: "Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning"
☆16Sep 21, 2023Updated 2 years ago
AIDefender / MyDiscor
View on GitHub
Unofficial Code for NeurIPS 2021 paper "Regret Minimization Experience Replay in Off-policy Reinforcement Learning"
☆14May 24, 2021Updated 5 years ago
SNU-EPEL / Integration-of-Reinforcement-Learning-and-Model-Predictive-Control-to-Optimize-Semi-batch-Bioreactor
View on GitHub
☆16Mar 8, 2022Updated 4 years ago
semitable / seac
View on GitHub
The official code base of Shared Experience Actor-Critic (NeurIPS2020)
☆26Feb 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
veloxml / VeloxML
View on GitHub
☆13Mar 1, 2025Updated last year
DelinQu / pj-probe
View on GitHub
A Visualization Tool for GPU Occupancy on S Cluster.
☆13Nov 16, 2022Updated 3 years ago
Yu-Maryland / Verilog-to-PyG
View on GitHub
☆31Apr 23, 2024Updated 2 years ago
Intelligent-Driving-Laboratory / Reinforcement-Learning-for-Sequential-Decision-and-Optimal-Control
View on GitHub
Source Code for "Reinforcement Learning for Sequential Decision and Optimal Control" by Shengbo Eben Li
☆16Dec 21, 2023Updated 2 years ago
TsinghuaC3I / LLM4BioHypoGen
View on GitHub
[COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
☆15Jul 15, 2024Updated 2 years ago
huawei-csl / spire-hdl
View on GitHub
Spire is a Python embedded domain-specific language (DSL) for RTL generation. Its built-in optimizations reduce area and delay of circuit…
☆24Updated this week
zhangir-azerbayev / MetaMath
View on GitHub
☆11Oct 11, 2023Updated 2 years ago