TomZahavy/CB_AE_DQN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TomZahavy/CB_AE_DQN)

TomZahavy / CB_AE_DQN

Contextual Bandits Action Elimination DQN

☆21

Alternatives and similar repositories for CB_AE_DQN

Users that are interested in CB_AE_DQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
View on GitHub
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆69Nov 28, 2019Updated 6 years ago
atavakol / action-hypergraph-networks
View on GitHub
(ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices
☆23Jun 22, 2021Updated 5 years ago
babylonhealth / decoding-decoders
View on GitHub
☆12Jul 14, 2022Updated 4 years ago
tomdbar / ecord
View on GitHub
Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".
☆13Jun 18, 2022Updated 4 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
KyriacosShiarli / taco
View on GitHub
☆25Jan 2, 2019Updated 7 years ago
VishaalMK / VectorDefense
View on GitHub
VectorDefense: Vectorization as a Defense to Adversarial Examples --->
☆13May 3, 2018Updated 8 years ago
edbeeching / 3d_control_deep_rl
View on GitHub
Baselines and memory-based scenarios for the ViZDoom simulator
☆36Dec 8, 2022Updated 3 years ago
Rowing0914 / Reinforcement_Learning
View on GitHub
Research repo of RL
☆23Mar 25, 2023Updated 3 years ago
lio-wong / llm-operators
View on GitHub
☆11Oct 29, 2024Updated last year
YashBansod / IPyHOP
View on GitHub
IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.
☆12Aug 12, 2022Updated 3 years ago
greentfrapp / doping
View on GitHub
Code for DOPING: Generative Data Augmentation for Unsupervised Anomaly Detection with GAN
☆15Aug 23, 2018Updated 7 years ago
michal-g / Notebooks-to-Packages
View on GitHub
course material for the "Notebooks to Scripts to Packages" workshop as part of Princeton Wintersession 2023
☆16Jan 17, 2024Updated 2 years ago
mjamroz / PlantRecognition
View on GitHub
Example of android app written in Qt/Qml which uses MXNet for plant image recognition.
☆10Nov 4, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bokveizen / non-fragile-hypercore
View on GitHub
[DAMI'23 (ECMLPKDD'23 Journal Track)] Hypercore Decomposition for Non-Fragile Hyperedges: Concepts, Algorithms, Observations, and Applica…
☆11Feb 15, 2024Updated 2 years ago
anki08 / Option-Critic
View on GitHub
A simple option critic framework using Q-Learning
☆14Feb 7, 2022Updated 4 years ago
ChangyWen / wolpertinger_ddpg
View on GitHub
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆65Dec 7, 2022Updated 3 years ago
PrincetonUniversity / software_testing
View on GitHub
☆18Jun 23, 2026Updated last month
thanard / causal-infogan
View on GitHub
☆85May 29, 2019Updated 7 years ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
punkcure / Iterative-GAN
View on GitHub
We propose a new variant GAN model to deal with image generation and transformation,especially in facial attributes area.
☆12Nov 16, 2017Updated 8 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Ktakuya332C / deepcube
View on GitHub
An implementation of the paper "Solving the Rubik's Cube without Human Knowledge"
☆14Dec 9, 2018Updated 7 years ago
BorealisAI / llm-pddl-planning
View on GitHub
☆18Feb 20, 2025Updated last year
brialorelle / kiddraw
View on GitHub
Project hosted at Stanford University examining developmental changes in children's drawings
☆23Sep 9, 2022Updated 3 years ago
Wizaron / binary-stochastic-neurons
View on GitHub
Binary Stochastic Neurons in PyTorch
☆57Jan 6, 2018Updated 8 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
View on GitHub
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆176Mar 1, 2018Updated 8 years ago
aravindsrinivas / upn
View on GitHub
☆33Jun 14, 2018Updated 8 years ago
allenai / faithful-nmn
View on GitHub
Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks
☆13Jun 12, 2023Updated 3 years ago
lil-lab / chalet
View on GitHub
Cornell House Agent Learning Environment
☆47Jun 22, 2022Updated 4 years ago
shenweichen / ReinforcementLearning
View on GitHub
This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.
☆19Jan 28, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
daomingAU / MontezumaRevenge_SDRL
View on GitHub
☆17Feb 25, 2020Updated 6 years ago
prajjwal1 / rl_paradigm
View on GitHub
Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"
☆17Jan 31, 2024Updated 2 years ago
haotian-liu / transformers_llava
View on GitHub
☆16Apr 28, 2023Updated 3 years ago
openai / atari-demo
View on GitHub
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆33Nov 22, 2018Updated 7 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
njustesen / a2c_gvgai
View on GitHub
A2C for GVG-AI
☆22Nov 7, 2018Updated 7 years ago
JINAN-xxx / gym_super_mario
View on GitHub
本项目旨在探索强化学习技术在经典游戏《超级玛丽》中的应用，通过训练一个智能代理来自主导航并完成游戏关卡。我们采用了深度Q网络（DQN）和双深度Q网络（DDQN）等先进的强化学习算法，结合神经网络，使得代理能够学习如何在游戏世界中生存并获得高分。项目特点强化学习实践：本…
☆18Mar 25, 2026Updated 4 months ago