强化学习课程,主要是如何用强化学习解决问题
☆15Dec 10, 2024Updated last year
Alternatives and similar repositories for RLCourse
Users that are interested in RLCourse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a deadly simple repl for cpp with ghci style commands☆14Jan 7, 2023Updated 3 years ago
- The improved SSD is used to detect small targets in CCSDB dataset☆12Dec 15, 2020Updated 5 years ago
- 深入理解计算机系统 课程实验 / CMU 15213: CSAPP Labs☆12Dec 13, 2019Updated 6 years ago
- java实现跨域SSO单点登录 springboot + SSO + JWT☆13Jun 17, 2022Updated 3 years ago
- ☆13Oct 18, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The project covers common metrics for super-resolution performance evaluation.☆12Dec 27, 2021Updated 4 years ago
- Project for CVPR 21 paper: "Cross-MPI: Cross-scale Stereo for Image Super-Resolution using Multiplane Images"☆18Oct 8, 2021Updated 4 years ago
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 6 months ago
- This repo consists all my RL work and learnings☆12Dec 5, 2021Updated 4 years ago
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- BASE-SQL: A powerful open source Text-To-SQL baseline approach☆13Feb 18, 2025Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Apr 21, 2024Updated 2 years ago
- 北京科技大学校园网自动登录程序☆12Jan 6, 2022Updated 4 years ago
- A Pytorch implementation of "An Efficient Unfolding Network with Disentangled Spatial-Spectral Representation for Hyperspectral Image Sup…☆13Jan 26, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Node.js 我去图书馆自动预约脚本☆26Oct 27, 2021Updated 4 years ago
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆20Mar 14, 2025Updated last year
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- ☆30Sep 24, 2024Updated last year
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- The pytorch code of hyperspectral image super-resolution method CST.☆17Sep 11, 2023Updated 2 years ago
- 基于deeplabv3plus网络实现了虹膜图像分割以及水果图像分割☆20Aug 28, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MCP servers collection☆10Feb 24, 2026Updated 2 months ago
- A simple lucky draw machine.☆24Jun 8, 2021Updated 4 years ago
- Rust ChatGPT CLI is a user-friendly terminal interface for OpenAI's GPT chatbot. Start conversations, manage ongoing chats, and send text…☆11May 9, 2023Updated 3 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 3 weeks ago
- 收集自网络的电视直播源及可以播放m3u、m3u8、flv、ts的Chrome扩展程序。☆65May 1, 2026Updated last week
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last month
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated last year
- 一款虚拟交换机软件,可以把不在同一地点的主机接入一个虚拟局域网下。☆17Oct 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Sep 26, 2021Updated 4 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- The python notebook is on googles new collabatory tool. Its a churn model being run on 3 different algorithms to compare.☆10Mar 3, 2018Updated 8 years ago
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated last year
- This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".☆10Aug 19, 2025Updated 8 months ago