microsoft/lightATAC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/lightATAC)

microsoft / lightATAC

A lightweight reimplementation of Adversarially Trained Actor Critic

☆19

Alternatives and similar repositories for lightATAC

Users that are interested in lightATAC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
microsoft / MAMBA
View on GitHub
Imitation learning from multiple experts
☆13Aug 29, 2022Updated 3 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
microsoft / EMNLP2019-Split-And-Recombine
View on GitHub
The code of EMNLP 2019 paper "A Split-and-Recombine Approach for Follow-up Query Analysis"
☆18Jul 20, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
jiangsy / slbo_pytorch
View on GitHub
☆15Sep 14, 2020Updated 5 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆63Apr 29, 2024Updated 2 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
facebookresearch / ssorl
View on GitHub
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆43Jul 16, 2023Updated 3 years ago
tigerneil / reinforcementlearning.today
View on GitHub
Made for a reading group at the Center for Safe AGI.
☆12Feb 23, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
microsoft / MazeExplorer
View on GitHub
Customisable 3D environment for assessing generalisation in Reinforcement Learning.
☆72Jun 12, 2023Updated 3 years ago
microsoft / platformer-ml-game
View on GitHub
Edutainment game teaching players concepts around machine learning
☆15Feb 18, 2020Updated 6 years ago
microsoft / BuildAnIntelligentBot
View on GitHub
This is the sample of the Talk to My Bot implementation of a smart bot that can interact with other bots.
☆25Jun 27, 2023Updated 3 years ago
microsoft / aicreator
View on GitHub
aicreator for aidata
☆14May 17, 2023Updated 3 years ago
microsoft / fnl_paper
View on GitHub
Factorized Neural Layers
☆31Jul 11, 2023Updated 3 years ago
microsoft / NTT
View on GitHub
Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]
☆14Jul 17, 2025Updated last year
microsoft / deepnmt
View on GitHub
☆31Jun 28, 2022Updated 4 years ago
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
braraki / logical-options-framework
View on GitHub
☆10Jun 7, 2021Updated 5 years ago
david-lindner / safe-grid-gym
View on GitHub
A gym interface for AI safety gridworlds created in pycolab.
☆18May 12, 2022Updated 4 years ago
rr-learning / trifinger_rl_datasets
View on GitHub
A python package for loading robotics datasets which were recorded on the TriFinger platform. Also contains simulated gym environments th…
☆17Jan 17, 2024Updated 2 years ago
microsoft / data-science-examples
View on GitHub
Quick useful examples of data science & ML & big data
☆16Jun 12, 2023Updated 3 years ago
Dragon-Zhuang / BPPO
View on GitHub
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆94Dec 13, 2023Updated 2 years ago
jity16 / When-to-Update-Your-Model-Constrained-Model-based-Reinforcement-Learning
View on GitHub
Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“
☆10Nov 2, 2023Updated 2 years ago
microsoft / MetaST
View on GitHub
☆26Jul 25, 2023Updated 2 years ago
jannerm / gamma-models
View on GitHub
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆48Sep 20, 2023Updated 2 years ago
davidbrandfonbrener / onestep-rl
View on GitHub
☆44Sep 19, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
microsoft / openpai-runtime
View on GitHub
Runtime for deep learning workload
☆21May 24, 2022Updated 4 years ago
lantunes / mountain-car-continuous
View on GitHub
Implementations of solutions to the continuous mountain car problem. Using OpenAI Gym and Tensorflow 1.1.
☆11Jan 29, 2018Updated 8 years ago
JiahangOK / MEMC_course
View on GitHub
THU Methematics for Engineering Master Candidates.(清华大学工程硕士数学课程)
☆11Nov 21, 2021Updated 4 years ago
sail-sg / PatchAIL
View on GitHub
Implementation of PatchAIL in the ICLR 2023 paper <Visual Imitation with Patch Rewards>
☆14Feb 15, 2023Updated 3 years ago
tarod13 / laplacian_dual_dynamics
View on GitHub
Dual optimization to learn laplacian eigenpairs in arbitrary spaces
☆18Dec 18, 2024Updated last year
Lunj12 / RL-Bandits-with-Knapsacks
View on GitHub
Dynamic Pricing BwK Problem and Reinforcement Learning
☆31Dec 11, 2018Updated 7 years ago
nicklashansen / svea-vit
View on GitHub
Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"
☆19Jul 11, 2023Updated 3 years ago