Bayesian Soft Actor Critic
☆16Jan 6, 2023Updated 3 years ago
Alternatives and similar repositories for bsac
Users that are interested in bsac are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Adopting reasonable strategies is challenging but crucial for an intelligent agent with limited resources working in hazardous, unstructu…☆13Dec 28, 2022Updated 3 years ago
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- Revisiting Discrete Soft Actor-Critic Accepted by Transactions on Machine Learning Research (TMLR)☆29Nov 23, 2024Updated last year
- 明朝那些事儿☆11May 31, 2022Updated 4 years ago
- ☆11May 10, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆12Jul 15, 2022Updated 3 years ago
- ☆13Mar 14, 2023Updated 3 years ago
- [AAMAS 2023] Code for the paper "Automatic Noise Filtering with Dynamic Sparse Training in Deep Reinforcement Learning"☆12Feb 22, 2024Updated 2 years ago
- This repository accompanies the following paper: A Workflow for Offline Model-Free Robotic RL☆13Nov 5, 2021Updated 4 years ago
- Minimum Energy Resource Allocation Strategy with partial offloading☆10Jan 17, 2022Updated 4 years ago
- ☆15Jun 2, 2024Updated 2 years ago
- A custom mode for MapboxGL Draw to cut polygons☆20Dec 14, 2024Updated last year
- official pytorch implements for the experiments of paper "Rethinking Pretraining as a Bridge from ANNs to SNNs"☆11Jun 4, 2024Updated 2 years ago
- A real-time crypto-dashboard, Typescript, Node, Websockets, CryptoCompare API☆16Jan 6, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code to related to my NIPS 2016 paper☆10Dec 4, 2016Updated 9 years ago
- pytorch implementation of SAC, TD3 and TD7 with Mujoco Benchmark results from 4 seeds.☆15Jul 4, 2024Updated last year
- ☆10Nov 5, 2024Updated last year
- 基于TensorFlow实现LSTM对未来股价预测☆11Jun 23, 2018Updated 7 years ago
- Repository for the paper: "Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation" @ NeurIPS 2022☆21Jul 10, 2023Updated 2 years ago
- ☆15Aug 8, 2022Updated 3 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- ☆24Aug 9, 2022Updated 3 years ago
- An implementation of the paper ‘Channel Distribution Learning: Model-Driven GAN-Based Channel Modeling for IRS-Aided Wireless Communicati…☆16Oct 27, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for ICLR 2024 paper "When should we prefer Decision Transformers for Offline Reinforcement Learning?"☆17Jan 31, 2024Updated 2 years ago
- Research project for Deep Reinforcement Learning using Decision Transformer☆16May 12, 2023Updated 3 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated 2 years ago
- ☆14Nov 23, 2023Updated 2 years ago
- ☆20Sep 29, 2024Updated last year
- Scrape arbitrarily sized images from google streetview for basically free!☆26Dec 25, 2023Updated 2 years ago
- A LLM-powered agent for NetHack☆23Nov 4, 2024Updated last year
- Sketch to Heightmap Authoring Tool based on Deep Learning with pix2pix☆19Oct 28, 2020Updated 5 years ago
- Import et visualize geographic data☆20Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Simulation routines for "Cell-Free Massive MIMO for URLLC: A Finite-Blocklength Analysis", Alejandro Lancho, Giuseppe Durisi and Luca San…☆18Mar 29, 2023Updated 3 years ago
- Implementing the Vision Transformer paper from scratch for course project.☆12Apr 25, 2022Updated 4 years ago
- pytorch☆14Dec 11, 2020Updated 5 years ago
- Long Short-Term Memory Recurrent Neural Network for Traffic Prediction in Cellular Networks☆12Sep 28, 2020Updated 5 years ago
- Actor Prioritized Experience Replay☆19Nov 20, 2023Updated 2 years ago
- ☆15May 12, 2023Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Feb 9, 2023Updated 3 years ago