强化学习课程,主要是如何用强化学习解决问题
☆15Dec 10, 2024Updated last year
Alternatives and similar repositories for RLCourse
Users that are interested in RLCourse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a deadly simple repl for cpp with ghci style commands☆14Jan 7, 2023Updated 3 years ago
- The improved SSD is used to detect small targets in CCSDB dataset☆12Dec 15, 2020Updated 5 years ago
- 深入理解计算机系统 课程实验 / CMU 15213: CSAPP Labs☆12Dec 13, 2019Updated 6 years ago
- java实现跨域SSO单点登录 springboot + SSO + JWT☆13Jun 17, 2022Updated 3 years ago
- ☆13Oct 18, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The project covers common metrics for super-resolution performance evaluation.☆12Dec 27, 2021Updated 4 years ago
- Project for CVPR 21 paper: "Cross-MPI: Cross-scale Stereo for Image Super-Resolution using Multiplane Images"☆18Oct 8, 2021Updated 4 years ago
- SimKO: Simple Pass@K Policy Optimization☆30Oct 24, 2025Updated 5 months ago
- This repo consists all my RL work and learnings☆12Dec 5, 2021Updated 4 years ago
- 整理所有特征工程用到的方法,为了复用☆11Jan 11, 2021Updated 5 years ago
- BASE-SQL: A powerful open source Text-To-SQL baseline approach☆13Feb 18, 2025Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Apr 21, 2024Updated last year
- 北京科技大学校园网自动登录程序☆12Jan 6, 2022Updated 4 years ago
- A Pytorch implementation of "An Efficient Unfolding Network with Disentangled Spatial-Spectral Representation for Hyperspectral Image Sup…☆13Jan 26, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Node.js 我去图书馆自动预约脚本☆26Oct 27, 2021Updated 4 years ago
- Repo allows users to test different DL archictectures when applied to time series forecasting of weather data (TCN, LSTM, BiLSTM, GRU, Bi…☆20Mar 14, 2025Updated last year
- Adaptive Machine Learning-Based Stock Prediction using Financial Time Series Technical Indicators☆10Dec 21, 2019Updated 6 years ago
- ☆30Sep 24, 2024Updated last year
- CPU Memory Compiler and Parallel programing☆26Nov 18, 2024Updated last year
- A Deep Reinforcement Learning model for high volume and frequency Forex Portfolio Management☆13Jan 11, 2023Updated 3 years ago
- The pytorch code of hyperspectral image super-resolution method CST.☆17Sep 11, 2023Updated 2 years ago
- 基于deeplabv3plus网络实现了虹膜图像分割以及水果图像分割☆20Aug 28, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple lucky draw machine.☆24Jun 8, 2021Updated 4 years ago
- Rust ChatGPT CLI is a user-friendly terminal interface for OpenAI's GPT chatbot. Start conversations, manage ongoing chats, and send text…☆11May 9, 2023Updated 2 years ago
- 收集自网络的电视直播源及可以播放m3u、m3u8、flv、ts的Chrome扩展程序。☆58Apr 11, 2026Updated last week
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Updated this week
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- RapidIn: Scalable Influence Estimation for Large Language Models (LLMs). The implementation for paper "Token-wise Influential Training Da…☆21Mar 10, 2026Updated last month
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 11 months ago
- 一款虚拟交换机软件,可以把不在同一地点的主机接入一个虚拟局域网下。☆17Oct 16, 2022Updated 3 years ago
- ☆12Sep 26, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Jun 16, 2023Updated 2 years ago
- A general framework used on evaluating the performance of large language models (LLMs) based on the peer review mechanism among LLMs☆19Aug 3, 2024Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- The python notebook is on googles new collabatory tool. Its a churn model being run on 3 different algorithms to compare.☆10Mar 3, 2018Updated 8 years ago
- The official implementation of SRM-Hair.☆35Mar 11, 2025Updated last year
- [ICLR 2025] Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron☆30Apr 30, 2025Updated 11 months ago
- This is the official code for our paper entitled "Dynamic Deep Factor Graph for Multi-Agent Reinforcement Learning".☆10Aug 19, 2025Updated 8 months ago