Pytorch Implementation of the Distributed SAC. Test environment is LunarLanderContinuous-v2 and Metaworld MT1, MT10
☆12Apr 6, 2022Updated 3 years ago
Alternatives and similar repositories for Distributed_SAC
Users that are interested in Distributed_SAC are comparing it to the libraries listed below
Sorting:
- [AAAI 2024 (Oral)] Safety-MuJoCo Environments.☆11Jun 4, 2024Updated last year
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- how to build a sentence embedding application using BentoML☆14Mar 31, 2025Updated 11 months ago
- [ECCV 2022] "TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual Information" by…☆10Sep 21, 2022Updated 3 years ago
- [CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification☆11Jul 12, 2024Updated last year
- ☆10Nov 27, 2019Updated 6 years ago
- ☆16Apr 28, 2023Updated 2 years ago
- deep learning based ovarian cancer segmentation☆17Nov 29, 2024Updated last year
- Planet wars RTS game for AI agent evaluation☆19Feb 17, 2026Updated 2 weeks ago
- This is MPE-pytorch, fix some bugs.☆10Apr 26, 2020Updated 5 years ago
- Instantly fix problems with ChatGPT AI. Use ChatGPT and GPT-4 AI tools to find one-click 'lightbulb menu' solutions to problems in your c…☆12Mar 26, 2023Updated 2 years ago
- DuBE: Duple-balanced Ensemble Learning from Skewed Data☆11Oct 31, 2022Updated 3 years ago
- PyTorch implementation of "Wasserstein Iterative Networks for Barycenter Estimation" (NeurIPS 2022)☆20Jul 3, 2023Updated 2 years ago
- Pytorch implementation of gradCAM, guidedBackProp, smoothGrad☆13Mar 5, 2019Updated 6 years ago
- Official implementation of "On the Effectiveness of Out-of-Distribution Data in Self-Supervised Long-Tail Learning" (ICLR 2023)☆15Jul 15, 2023Updated 2 years ago
- Codes for 'Learning Probabilistic Topological Representations Using Discrete Morse Theory'☆14Sep 19, 2023Updated 2 years ago
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- TNT-KID: Transformer-based Neural Tagger for Keyword Identification☆11Jul 25, 2024Updated last year
- Collection of my Reinforcement Learning (RL) practices including DQN, D3QN, and Adaptive Gamma, applied to the Lunar Lander and CartPole …☆16Oct 21, 2024Updated last year
- On Simple Reactive Neural Networks for Behaviour-Based Reinforcement Learning by Ameya Pore and Gerardo Aragon-Camarasa☆10Jan 28, 2020Updated 6 years ago
- No more bad words in your console logs!☆10Feb 8, 2019Updated 7 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"☆13Dec 9, 2023Updated 2 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 7 years ago
- Data Guy Story commandline☆11Dec 2, 2022Updated 3 years ago
- Fault-Tolerant Neural CBF☆14Feb 23, 2024Updated 2 years ago
- Nonlinear MPC implemetation for both kinematic and dynamic omni directional model☆28Jan 2, 2022Updated 4 years ago
- Model Predictive Control (MPC) for kinematic bicycle model☆12Mar 2, 2023Updated 3 years ago
- Passivity-based manipulator control with guaranteed constraint satisfaction in Drake☆18Apr 24, 2023Updated 2 years ago
- A preliminary platform for up to 1 million reinforcement learning agents☆11Aug 27, 2017Updated 8 years ago
- Deep Learning (FS 2020)☆17Oct 10, 2022Updated 3 years ago
- Solution to Kaggle's Google Research Football Competition☆14Dec 2, 2020Updated 5 years ago
- Testing different RL algorithms for multi-agent environments. From SARSA, QLearning to Independent Q-Learning, Joint Action Learning and …☆12Mar 29, 2019Updated 6 years ago
- Archive of Tasks and Results of the Video Browser Showdown☆13Feb 2, 2026Updated last month
- ☆27Oct 30, 2025Updated 4 months ago
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.☆15Jan 3, 2024Updated 2 years ago
- A Framework for Comparing N Hyperparameter Optimizers on M Benchmarks.☆19Feb 22, 2026Updated last week