强化学习资料
☆23Sep 5, 2019Updated 6 years ago
Alternatives and similar repositories for reinforcement_learning
Users that are interested in reinforcement_learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 3 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Jan 12, 2022Updated 4 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- ☆17Mar 21, 2021Updated 5 years ago
- Adversarial Inverse Reinforcement Learning Implement For Mountain Car☆36Sep 21, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Dec 31, 2020Updated 5 years ago
- ☆23Jul 4, 2023Updated 2 years ago
- ECHO is a semi-supervised framework for classifying evolving data streams based on our previous approach SAND. The most expensive module …☆12Dec 25, 2017Updated 8 years ago
- Incremental and Adaptative Gaussian Mixture Model Library is an implementation of a machine learning algorithms based on GMM. The algorit…☆13Mar 23, 2020Updated 6 years ago
- SAND: Semi-Supervised Adaptive Novel Class Detection and Classification over Data Stream☆17Dec 25, 2017Updated 8 years ago
- Code accompanying the paper "Off-Policy Primal-Dual Safe Reinforcement Learning"☆22Mar 29, 2024Updated 2 years ago
- Lark 套件(飞书)Linux 客户端 release。非官方。☆10Jul 3, 2021Updated 4 years ago
- ☆14Feb 18, 2023Updated 3 years ago
- Stochastic state-space Inference and Prediction☆16Jun 1, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AlphaHydrogen is an open source OpenAI Gym environment that simulates the energy system of a residential community with distributed renew…☆17Oct 5, 2021Updated 4 years ago
- cppad+Ipopt demo以及OSQP demo程序☆13Nov 23, 2022Updated 3 years ago
- Density Ratio Estimation via Infinitesimal Classification (AISTATS 2022 Oral)☆22Mar 12, 2022Updated 4 years ago
- The code for the paper "An Online Method for A Class of Distributionally Robust Optimization with Non-Convex Objectives"☆13Oct 13, 2022Updated 3 years ago
- notes for NJU courses☆18Oct 26, 2021Updated 4 years ago
- This is a repository of DQN and its variants implementation in PyTorch based on the original papar.☆13Nov 18, 2019Updated 6 years ago
- ☆10Feb 17, 2023Updated 3 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆56Jul 25, 2019Updated 6 years ago
- Grey-brick buildings, an open source data set of 225 calibrated Dutch residential building heat models, comprising identified models and …☆22Feb 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆675Apr 9, 2022Updated 4 years ago
- ☆11Nov 2, 2021Updated 4 years ago
- TIP2022-BGDC-A Unified Pansharpening Model Based on Band-Adaptive Gradient and Detail Correction☆13Mar 7, 2025Updated last year
- kubernetes client to☆11May 27, 2022Updated 4 years ago
- Code for "Self-Sustaining Representation Expansion for Non-Exemplar Class-Incremental Learning"☆20Oct 26, 2022Updated 3 years ago
- 基于folly、wangle和proxygen的c++11基础库☆11Apr 29, 2018Updated 8 years ago
- 人工智能与深度学习实战 - TensorFlow 篇(MD & Notebooks)☆13May 13, 2026Updated 3 weeks ago
- ☆10Apr 22, 2025Updated last year
- ☆14Sep 10, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jun 27, 2021Updated 4 years ago
- This repository will help you to clone voice to generate an arbitrary speech in real time☆12Apr 24, 2020Updated 6 years ago
- ☆12Nov 6, 2024Updated last year
- Incorporating Neuro-Inspired Adaptability for Continual Learning in Artificial Intelligence☆28Dec 12, 2023Updated 2 years ago
- [NeurIPS 2020, Spotlight] Code for "Robust Deep Reinforcement Learning against Adversarial Perturbations on Observations"☆143Nov 16, 2021Updated 4 years ago
- ☆13Aug 15, 2022Updated 3 years ago
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago